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THE CRITERION METHOD: 


SOME ANALYSES AND REMARKS 


RICHARD S. BOGARTZ! 


Institute of Child Behavior and Development, University of Towa 


The distribution of trials to criterion under the null hypothesis of no learning is ob- 
tained for 2 broad classes of criteria. Tables are provided which give the maximum 
permissible number of trials which may be run such that the probability of reaching 
criterion when learning does not occur remains less than a selected value. These 


results include Grant's analy 


sis of the “runs” criterion as a special case. Illustrations 


of a number of problems which arise in the use of the criterion method also are pre- 
sented to emphasize the often ignored dependence of method upon theory. 


In the study of learning and transfer, psy- 
chologists frequently employ criteria to deter- 
mine when to terminate the existing treatment 
condition, In most instances the selection of a 
criterion is arbitrary, and although some 
attention has been paid to a number of prob- 
lems which arise in the use of criteria (Ander- 
son, 1963; Hilgard, 1938, 1951; Underwood, 
1954), there is still little in the way of de- 
tailed analysis to facilitate efficient criterion 
selection. A notable exception is ( 
(1946, 1947) treatment of the “ 
terion and the problem of chance 
Grant (1947) pointed out that 
ble for the subject to make 
Sponse by chance, then given enough trials he 
m reach even the most severe criterion, al- 
T ee comet i level does not actu- 
meee taper mata Grant S concern was 
a pret criteria which require a run 
lished saber _ Correct Tesponses. He pub- 
; Slving for various values of n 
and p (the chance probability 
l l y of a correct re- 
sponse), the largest number of trials which 
€ be run such that the Diebabiliry p 
reaching criterior n : ae 
than ae salsted Bie a aea = 

. € present re- 


irant’s 
runs” cri- 
attainment. 
if it is pos- 
a correct re- 


si 


a 


up fond 
tiers he author is indebted to Ellen Brewer, who was 
i Ree for the computer work throughout most of 
helpful Pca, and to Joan Cantor, who made some 
results for Bgestions concerning arrangement of the 


the Class I and Class II criteria. 


port this probability that. criterion is — 
given that the hypothesis of anaes pe 
mance is true, will be referred to as the criterio 
risk (CR). É 

T ie of the CR is general. o 
criterion, for a given value of p, anemi Jh 
CR for each possible number of Pe 
which the experiment can be ere ee g 
say, given a constant chance prol ak a ee 
correct response on each trial, any a 
defines a probability density ey Tea 
therefore a distribution function a n 
attainment over trials. Consequently, Dp A 
be useful to also have tables compart 
jrant’s for other criteria. aeii 
p. the more commonly used ea 
are members of one or the other of si Je 
The first class (Class I) involves nae rt 
based on blocked trials. The morn a 
this class is the criterion of at or foe 
responses in each of L gee ue 
M trials. For example, the ey cor igs 
criterion if he makes at leas aes at 
sponses on each of 2 ere E E 
trials, (K, L, M) = (9, ay 3 eae Ii 
stance would be 10 Bae Bi 1, 10). 
block of 10 trials, (K, L, 4 ) os Po 
The second class (Class at pe ba yee 
ing analysis over unblocked tra A ee W 
rot ce is K correct TERpORS pa PS 
u E trials; for ile rcen us 
sponses in any 10 consecuti 
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TABLE 1 


MAXIMUM PERMISSIBLE NUMBER OF TRIALS AT CRs oF .01, .05, anp .10 vor CRITERIA . 
OF AT Least K Correct Responses IN L = 1 Brock or MV Trials, p = $ 


és iC ae 
Kil 

CR 5:5 6:6 7:7 8:8 7:8 9:9 8:9 10:10 9:10 1l 
01 = — 7 16 = 45 — 100 — a 
05 5 18 42 104 8 234 18 520 40 115 
10 15 36 91 208 16 477 45 1,070 90 = 

10:11 9:14 12:12 11:12 10:12 13:13 12:13 11:13 IEH 13:14 
01 11 — 9 36 — 1,066 65 —s 2,296 re 
-05 88 11 — 192 24 —b 377 52 —» m 
‘10 187 3 á = 396 60 —» 793 117 — 4 

12:14 14:15 13:15 12:15 15:16 14:16 13:16 16:17 1521747 
.01 14 300 30 =a 608 6 — 1% 136 a | 
05 98 1,575 195 30 3,152 384 64 = 731 F 
10 224 — 420 75 > 800 144 —> 1,513 

17:18 16:18 15:18 14:18 17:19 16:19 15:19 18:20 17:20 16: 
01 | 2484 270 36 — 513 76 19 980 140 n 
05 -5% 1692 234 54 2,660 437 95 a 780 : S 
10 —>» 2880 486 108 —> 893 190 — 1,620 

A 


® Criterion risk is exceeded in the first L-M trials. 
è More than 200 M trials may be run without exceeding the criterion risk. 


TABLE 2 


MAXIMUM PERMISSIBLE NuMBER OF Triars aT CRs oF .01, .05, AND 10, FOR CRITERIA 
OF at Least K Correct RESPONSES IN L = 1 Brock or M Triats, p = } 


K:M 
5:5 6:6 7:7 6:7 8:8 7:8 9:9 8:9 7:9 
10 42 147 7 520 24 1,773 90 9 
60 222 784 49 — 152 — 477 54 
125 456 — 105 — 320 —> 981 108 


8:10 7:10 10:11 9:11 8:11 10:12 9:12 11:13 10:13 


20 = 847 77 11 216 24 611 78 
150 20 — 407 55 1,128 156 $ 403 
300 50 —> 836 121-2316 324 mth 819 


12:14 11:14 10:14 12:15 11:15 13:16 12:16 13:17 14:18 


1,708 196 28 525 75 1,376 192 493 1,242 
—> 1,036 168 2,685 420 —) 1,024 2550 ay 
-= 2,128 364 = 870 —b 2,112 —b —b 


re ui ig eded in the first L-M trials. Sa i 
Seco e tials may be run without exceeding the criterion risk, 
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TABLE 3 


Maximum PERMISSIBLE NuMBER OF TRIALS AT CRs oF .01, .05, and 10, FoR CRITERIA 
OF AT LEAST K CORRECT RESPONSES IN L = 1 Biocx or M TRIALS, p=t 


K:M 

CR SeS 6:6 5:6 7:7 6:7 7:8 6:8 8:9 7:9 8:10 
_ SF 

OL 50 246 12 1,148 49 208 16 846 63 2 
05 260 —b 66 —> 266 1,072 96 — 342 1230 
0 535 —> 132 —> 546 —b 192 — 702 = 
7:10 9:11 8:11 Tti 9:12 8:12 10:13 9:13 10:14 11:15 
01 20 869 88 11 300 36 1,027 130 406 1,808 
205 140 —b 473 66 1,560 216 — 663 2100 a 
10 300 — 968 143 —> 444 —> 1,378 = = 


b More than 200 M trials may be run without exceeding the criterion risk. 


seen that Grant’s runs criterion is a member 
of both these classes. In Class I it is of the 
form (1, 2, 1), and in Class II it is of the 
form x correct in any 2 consecutive trials, 

Appendix A gives the derivation required 
for determining the largest permissible number 
of trials which may be run at a given value 
of the CR for a Class I criterion, assuming a 
chance probability of a correct response equal 
to p. The corresponding derivation for Class IT 
is presented in Appendix B. 


Crass I CRITERIA 


To obtain results comparable to those of 
Grant (1947) for the Class I criteria, a pro- 
gram was written for the IBM 7090 which 
generated the CR for each of the first 200 
blocks of trials for various values of K, L, M, 
and p. The M ranged from 5 to 20, K from 
M-—4 to M, and L from 1 to 3. The values of 
p were 3, 3, and }. The program also gener- 
ated the largest number of trials which may 
be run at various CRs for these criteria. Some 
of these results are presented in Tables 1-7. 


TABLE 4 


MAXIMUM PERMISSIBLE NuMBER oF TRIALS AT CRs or .01, .05, AND .10, FOR CRITERIA OF AT LEAST 
K Correct Responses 1x L = 2 Consecutive Brocks oF M TRIALS, p = $ 


K:M 
CR 5:5 4:5 6:6 5:6 7:7 6:7 7:8 6:8 8:9 7:9 
o 3 18 

01 55 — 252 — 1,162 21 72 = 24 5 
05 275 10 =H 30 E 98 344 24 1,233 s 
10 560 20 =i 60 —} 203 712 48 = 1 

O10 8:10 7:40 Ostt O g1 1042 992 $12 11:13 10:13 
OL 890 40 —s 110 = 336 24 — 1,053 65 
05 — 180 20 550 55 1,692 132 “a = ras 
10 — 370 40 1,122 99 = 264 48 = 

9313 11:14 10:14 12:15 11:15 13:16 12:16 13:17 14:18 15:19 i 
01 —s 182 28 510 45 1,440 128 306 ara, B0 
05 52 910 98 2,535 240 — 576 1,496 -e a 
10 91 1,848 210 —b 480 — 1,184 3,060 = 


” Criteri P 3 
v More than aed exceeded in the first L-M trials. 


4 trials may be run without exceeding the criterion risk. 
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TABLE 5 


MAXDUM PERMISSIBLE NUMBER OF TRIALS AT CRs oF .01, .05, AND .10 FoR SELECTED CRITERIA OF AT 


Least K Correct RESPONSES IN L = 2 Consecutive BLocks or M TRIALS, p = $ 


6:10 


6:9 


CR 5 5:6 Sur 6:8 7:10 

Ol B 30 198 42 216 16 1,332 54 270 20 
‘05 | 135 990 189 1,088 64 b 270 1,360 100 
10 | 270 —b 378 —b 120 — 549 —b 200 
ks | 


v More than 200 M trials may be run without exceeding the criterion risk. 


TABLE 6 


MAXIMUM PERMISSIBLE NUMBER OF TriaLs AT CRs oF .01, .05, AND .10, FOR SELECTED CRITERIA OF AT 


LEAST K Correct Responses IN L = 2 Consecutive BLocks oF M TRIALS, p = 4 


356 


CR | 25 4:6 3:7 4: 
.01 210 48 24 434 2 
05 ei 228 84 h 7 
10 => 468 162 — 16 


K:M 
5:8 4:8 06:9 5:9 6:10 
112 a 918 45 270 
568 40 —h 207 1,350 
1,168 72 t 414 —*t 


Thus, Table 1 shows that with a criterion of 
8 out of 8 in one block, when p = 3, no more 
than 104 trials may be run if the CR is to 
remain less than or equal to .05. It also indi- 
cates that with a criterion of 7 out of 8 in one 
block the CR exceeds .01 in the first block, 
but that with a criterion of 11 out of 11 in one 
block, more than 2,200 trials may be run 
without exceeding a CR of .10. 

It will be noted that certain of the more 
severe criteria are omitted from Table 1 for 
the larger values of M. For example, when 
M=15, values are given only for K=12, 13, 
and 14, but not for K = 15. These values 
are omitted because the CR remains less 


TABLE 7 


SUMMARY TABLE OF CRITERIA WHICH 


ARE “SAFE” FOR L = 3 
K 
P y m-i -3 W-3 M-A 
a 8 11 14 17 
i 5 6 8 19 12 
r ; 3 7 5 10 


b More than 200 M trials may be run without exceeding the criterion risk. 


than .01 even when the experiment is run for 
200 blocks of Af trials, so that the entries 
would be Footnote b for .01, .05, and .10. In 
general, throughout Tables 1-6, when for a 
given value of M, the largest value of K 
which is given is less than M, all larger values 
of K are described by Footnote b for the 
three tabled values of the CR. 

For purposes of exposition, let a Class I 
criterion be referred to as a “safe” criterion if, 
given a value of p, that criterion may be used 
for 200 blocks of trials without the CR exceed- 
ing .01. The term “safe” should not be inter- 
preted as “necessarily desirable” since in 
practice it will often be advisable to choose 
criteria which are not safe in the sense in 
which the term is used here. 

When L = 3, most of the criteria for which 
values have been obtained are safe. In Table 7 
the smallest values of M are given for which the 
indicated values of K (i.e, M, M—1,...5 
M — 4) constitute safe criteria. Thus, for ex- 
ample, a criterion of M — 2 out of M on three 
consecutive blocks will be safe if M is 11 oF 

greater when p = 3, 8 or greater when p = 3 
and 7 or greater when p = 3. 
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Crass II CRITERIA 


Comparable results for Class II criteria are 
presented in Tables 8-11. For values of M 
greater than 10 an approximation method was 
employed (see Appendix B). The results given 
in Table 8 permit comparison of approximate 
with, exact values. The exact values of the 
maximum permissible number of trials which 
may be run at values of K = M — 2 to M, 
M = 5 to 10, for p = 3, 3, and 3, are shown 
together with the nonzero differences (in 
parentheses) between the approximate and the 
exact values. A positive difference indicates an 
overestimate of the exact value; a negative 
difference, an underestimate. Thus, for ex- 
ample, the exact value for the criterion 8 out 
of 9, when p = 3 and the CR is .05, is 107 
trials; and the negative difference of 6 indicates 
that the approximate value is 101. 


The approximation is quite good for A = M; 
however, this case is of little interest so far as 
approximation is concerned since it is the 
Grant runs criterion for which exact values 
already have been tabled. For K less than M 
the approximation appears adequate for most 
purposes. Within the typical range of trials 
employed in psychological experiments, the 
approximation remains close to the exact value. 
In general, the approximate value departs 
increasingly from the exact value as the exact 
value increases (i.e, as K, M, or the CR 
increases, and as p decreases). It appears that 
for K less than M the approximation is con- 
servative in that it underestimates the exact 
value; however, this remains to be proved for 
the general case. 

In Tables 9, 10, and 11, the approximate 
values of the maximum permissible number of 


TABLE 8 


MAXIMUM PERMISSIBLE NUMBER OF TRIALS 


at CRs oF .01, .05, AND .10, FOR CRITERIA 


or K Correct RESPONSES IN M CONSECUTIVE TRIALS, p = 3,4) t 


M 
b K CR 5 6 7 8 9 10 
01 cia am 7 11 17 28 
M 05 6 10 17 (1) 31 (1) 59 112 (1) 
10 9 16 (1) 31 59 113 (1) 222 (1) 
: 01 aa Aa La 7 8 9 
2 M-i .05 — 5 6 9 13 19 
10 4 5 13 20 (—1) 32 (-1) 
01 —s = R ba 7 8 
M-2) 05 = ase 5 6 8 9 
10 = = 5 7 9 12 
01 7 15 38 105 304 898 
M a 211) 60 173 510 (1) 1,521 = 
10 41 119 35 1,042 (1) = aes 
à 01 —a 5 A p W 13 t 26 (—1) 60 (—3) 
3 i= 05 3 9 19 44 (—2) 107 (-6) 274 (14) 
10 7 15 35 (2) 84 (—4) 213. (-12) $58 (—29) 
01 sa S 5 7 9 14 
M-32 05 3 5 > 11 21 (-1) 42 (—3) 
10 3 5 10 18 (—1) 37 (-3) 80 (-8) 
5 01 17 50 225 834 ai si 
05 73 284 1,125 (1) — = ei 
10 146 (1) 579 (1) 2,306 (1) — = a 
i ‘01 4 ae it) SL 
t |a] 0 9 23 (a1) (a 229(—10) 786 (—32) 2,766 (—102) 
10 15 44 (—2) 130(—6) 465 (—21) 1,607 (-66) > 
‘01 1 4 6 10 22 (-1) 55 (~4) 
2] o5 3 6 13 31(—2)  s4(=6) 248 (~ 19) 
at 10 4 10 23 (—2) 59 (—5) 167(-14) 502 (40) 


“Numbers in p 
eos in parenth 

SE exceeded on Tri 
SK not exceeded by 


m 
ion 
ia} 3,000. 


ps obtained by subtracting e: 


act value from value obtained with ap- 
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6 
TABLE 9 
APPROXIMATE MAXIMUM PERMISSIBLE NUMBER OF TRIALS AT CRs oF .01, .05, AND .10, FOR CRITERIA 
OF K Correct RESPONSES IN M CONSECUTIVE TRIALS, p = 3 
M 
K CR il 12 13 14 15 16 17 18 19 20 
.01 12 17 24 37 59 101 179 325 602 1,126 
M— 1| .05 29 47 80 140 252 461 855 1,597 
-10 51 88 154 277 505 934 1,740 
01 9 11 12 15 19 25 36 54 85 140 
M-— 2| .05 12 16 22 32 49 77 126 213 368 646 
10 17 24 36 55 88 144 244 421 738 1,309 
01 8 9 11 12 14 15 18 22 29 38 
M — 3| .05 9 11 13 16 21 27 38 54 82 128 
10 10 13 17 22 30 43 63 96 151 245 
.01 7 8 9 11 12 14 15 17 19 22 
M—4| .05 8 9 11 13 14 17 20 25 33 44 
10 9 10 12 14 17 22 28 37 51 73 


Note.—Empty cells denote approximate values greater than 3,000. 


trials for various values of K and M are given, 
for p= }, 3, and 4, respectively. Approxi- 
mate values were not obtained if they ex- 
ceeded 3,000. The only value of M — 1 out 
of M which did not exceed 3,000 when p = } 
was 1,882 for 10 out of 11. 


REMARKS 


Some remarks may facilitate the use of these 
results. Other more general remarks are also 
included which may give the investigator 


reason to reconsider the desirability of using 
the criterion method. 

Running Analysis. When the same values of 
K and M are used in both the running analysis 
and the blocked trial method (Z = 1), the 
former always entails a greater criterion risk 
than the latter (and increasingly so as / 
increases). The direction of this difference has 
been known; these results permit evaluatio? 
of the magnitude of the difference. It can b? 
seen that in terms of the permissible number 0 
trials, the difference can be sizable. 


TABLE 10 


APPROXIMATE Maximum PERMISSIBLE NUMBER OF TRIALS at CRs oF .01, .05, AND .10, FOR CRITERIA 
or K Correct RESPONSES IN M Consecutive TRIALS, p = 4 


M 
x | GR 11 12 13 14 15 16 17 18 19 20 
01 142 374 1,012 2,781 
1—1] .05 690 1,868 
10 1,408 
‘oO 23 46 102 243 607 1,557 
-2| 05 84 195 475 1,191 
á 10 164 391 965 2,434 
at 11 15 24 43 87 192 449 1,089 2,698 
3| 05 22 40 80 172 391 923 2,232 
w= io 37 73 153 342 791 1,882 
‘ol 9 11 13 17 26 43 81 169 374 865 
ae 11 16 25 483 81 165 355 798 1,842 
meg o 16 25 42 77 154 325 715 1,623 
k p rity cells denote approximate values greater than 3,000, 


No 
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An even more striking result for the running 
analysis is the difference in the permissible 
number of trials between the case when K = M 
and that when Ķ is less than M. For in- 
stance, when the chance probability of a cor- 
rect response is one half, and a CR of .05 
is adopted, the permissible number of trials 
with a criterion of 10 out of 10 is 112; with 
a criterion of 9 out of 10 it drops to 19! 

CR and Error Rates. The CR may be thought 
of as the Type I error rate in the test of the 
chance performance hypothesis; that is, the 
— s concluding that the subject is 
isa mat oe than chance when in fact 
oi ors here is a corresponding T: ype II 
the ols ie probability of concluding that 
arias ‘a P os) sb probability of a correct re- 
enter E : shige level when in fact it is 
Feal hee : = Type II error rate will depend 
initial T a of the criterion, the subject’s 
tie ability of a correct response, his 

ni re rate, and the selected value of the CR. 
ails ee circumstances the present 
the Type I lg nay not only in setting 
Type Irate pa but also in controlling the 
college sadenie wie considerations, such as 
50 mihütes en being available for only 
willin » OF preschool children being un- 
villing to remain in m ori al situ: 
tions longer than aes experimental situa- 
mine the ie 0 minutes, may deter- 
be rum in a i aer of trials which can 
maximum number S manag a Wren this 
be possible to : Pi known, it will usually 
criteria switch, a ge among several 
criterion which n A e desired CR that 
reduce the Type IT es R and thus to 
tables will be tele og rate. This use of the 
extent that the Dai y appropriate to the 
m concluding that i aenter 1s only interested 
better than chance = ce is performing 
the subject’s perfor has little interest in 
of the criterion, mance level at attainment 

Criteria and Training-Ty 
Often the experimenter is ie cl Paradigms. 
hypothesis of chance E Gee. artie 
in “equating” subjects whose temia a rather 
ance levels would differ if they ime re 
for the same number of trials. Such t alt zun 
differences could be due, for example Pah 
ences in initial performance levels, dienes 
M parameters governing the rate of angen 


TABLE 11 


APPROXIMATE MAXIMOM PERMISSIBLE NUMBER OF 
Triats at CRs oF .01, .05, AND .10, FOR 
CRITERIA oF K Correct RESPONSES IN M 
CoxsecuUTIVE TRIALS, $ = $ 


M 
Kicr| 1 12 13 14 B #16 17 
01| 148 464 1,525 
\f—2) .05 | 717 2,328 
10 |1,463 
01} 26 60 162 477 1,476 
\—3| 05} 97 267 783 2,389 
‘10| 190 539 1,598 
w| 11 18 33 73 190 534 1,596 
ur—4}io5| 25 52 125 328 918 2,669 
‘10| 43 97 245 662 1,872 


Note.—Empty cells denote approximate values greater than 
3,000. 
the performance level over trials, or both. The 
logic of this method of equating subjects can 
be exemplified with the linear learning model 
of stimulus sampling theory. Assume that one 
response, A1, is reinforced on each trial of the 
training stage of an experiment. The linear 
model asserts that for Subject «, 


pen =1— (L— Pa) (t 0), 


where pz,» is the probability of occurrence of 
A, on Trial 2, pz, is the initial probability of 
that response, and 6: is the learning rate 
parameter. Also let pz, denote the prob- 
ability of A1 for Subject « following the ter- 
minal trial of the training stage. 

If all of the subjects are run for the same 


number of training trials there will be a dis- 
tribution of terminal values over the group 
of subjects. The variance of this eee 
will depend upon the variance of the distribu- 


tion of rate parameters and that of the dis- 
tribution of initial probabilities. For ihe pi 
poses of this discussion 1t will aiee Sei 
upon the distribution of 6 values ani oe 

that all subjects have the same value i h 
This will correspond to the context of “fast 

and “slow” learners in which the use of criteria 


is most often discussed. 


The general problem 0 ne 
is analogous to the problem of reducing 


“atributi . In 
variance of the distribution of pi Lay ie 
terms of the model, the criterion ae a8 pr 
attempt to accomplish this by adjustm 


f equating the subjects 
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the number of trials a subject receives to his 
rate parameter so that subjects with high 
values of 0 (fast learners) are run fewer trials 
than subjects with low values (slow learners). 
Since pz, depends upon both 8+- and upon 72, 
the number of trials that Subject x is run, 
the variance in the distribution of p; values 
will depend upon the variance of the distribu- 
tion of @ values, the variance of the distribution 
of n, and upon the covariance of with 8. The 
basic premise of the criterion method is that 
termination of training at attainment of the 

criterion produces a covariance of n and 6 
which more than offsets the variance added to 
the distribution of p: by variation in x. 

Certain aspects of the criterion method 

become somewhat more obvious when the 
method is viewed in these terms. In particular, 
if all of the subjects have the same learning 
rate and the criterion method is used, in 
general they will not all reach criterion on the 
same trial, just as subjects who are not learning 
(i.e., performing at the chance level) reach 
criterion at different trials (see Appendices A 
and B). (This is because a portion of the 
variability in the number of trials to criterion 
is produced by the criterion method itself and 
has nothing to do with differences in learning 
rates.) As a consequence, the criterion method 
introduces variance in the distribution of pe 
due to variance in n, which cannot be offset 
by covariance of n with @ since all subjects 
have the same value of 0. Thus the criterion 
method actually will produce greater varia- 
bility in the true performance levels than would 
be obtained if all subjects had been run for 
he same number of trials. 

An obverse implication also follows from the 
fact that even when homogeneity of learning 
rates exists, subjects will reach criterion at 
different trials: If a criterion method is used 
and it is found that some subjects reach 
criterion sooner than others, it does not follow 
that heterogeneity of learning rates exists In 
he group of subjects, since this same finding 

would be expected even with homogeneity of 
learning rates. This implication may have 
serious consequences so far as interpretation of 
experiments in which subjects have been sepa- 
rated into fast and slow learners on the basis 
of their performance as measured by the num- 
þer of trials to criterion. SOEST 

Separation of Subjects by the Median Trial 
| The implications of using the 
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median trial to criterion to divide a group of 
subjects into fast learners and slow learners 
have not been treated in the psychological 
literature. Some of these implications are not 
so obvious as others. Many rest upon the 
previously mentioned fact that just as there is 
a distribution of trials to criterion in the case 
where subjects are not learning, there will also 
be a distribution when they are learning, and 
that this holds whether or not the group is 
homogeneous with respect to learning rates. 

In the following discussion it is assumed for 
illustrative purposes that the learning function 
for Subject x is bzn = 1 — .5(1 — 0)". 
Thus each subject has an initial probability 
of a correct response equal to .5, and there- 
fore the subjects may only differ in their 
learning rate parameters. It is also assumed 
that an overall group of subjects comprises 
two subgroups having the same number of 
subjects, and that all subjects in the same 
subgroup have the same learning rate; that 
is, the subgroups are homogeneous. 

Now, given the learning function and @ 
Class I criterion to which the entire group of 
subjects is to be run (eg, LA, L, MJ 
= [1, 6, 1]; a criterion of six consecutive cor’ 
rect responses), it is possible to compute thé 
probability of reaching criterion on the mth 
trial, P(C,). It is then possible to obtain the 
expected value of the probability of a col 
rect response on the first postcriterion trial) 
E(p:), by a summation of products, 


E PCA- 31 — o. 


Thus an £(p,) can be computed for each of the 
two subgroups comprised by the overall group. 
These two values will be referred to as the True 
Slow E(p,) and the Truc Fast E(p,) wher 
computed for the subgroups with the slowe! 
and faster learning rates, respectively. 

Two other expectations of interest can als? 
be computed. First the median trial to criterio? 
for the overall group must be obtained. 
P.(C,) is the probability that a subject in the 
slow subgroup reaches criterion on Trial 1” 
and P;(C,) is the corresponding value for 2 
fast subject, then under the assumption of an 
equal number of subjects in each subgroup, 
the theoretical median trial to criterion for i 
overall group will be that value N whic 
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TABLE 12 


EFFECT OF USING THE MEDIAN TRIAL TO CRITERION TO SEPARATE “FAST FROM “SLOW” LEARNERS 


Cited Method of 
separation Learning rate pairs in overall group 
N 01-01 .01-.03 .01-.05 01-07 01-.09 
Slow Fast Slow Fast Slow Fast Slow Fast Slow Fast 
6 True 663 663 3 75 
: 3 3 663 756 663 806 663 840 663 865 
Median 119 602 749 672 747 720 743 759 730 791 
8 True 730 730 730 817 730 860 730 888 7350 909 
Median 789 668 797 751 787 803 773 883 765 870 
10 True 781 781 781 859 781 895 781 919 781 936 
Median 836 725 832 808 5 861 809 893 803 915 
.03-.03 .03-.05 .03-.07 .03-.09 .03-.11 
Slow Fast Slow Fast Slow Fast Slow Fast Slow Fast 
6 True 756 756 756 806 756 840 756 865 756 885 
Median 824 696 840 729 840 761 37 781 83. 806 
= T 
8 True 817 817 817 860 817 888 817 909 817 925 
Median 873 157 883 790 8 826 874 853 809 872 
10 True 859 859 859 895 859 919 859 936 859 948 
Median 907 807 914 844 906 869 899 895 895 913 


Note.—Decimal points omitted. 


minimizes 


N 
IZ [P.(Cx) + PCa) = 1- 
L x 
n ind 0; be the two learning rates. Then, 
se the E(p,) for those subjects in the 
ite all group who reach criterion on or before 
median trial, the “fast” subjects, will be 


X PCE — 3 — 0)" 
+ P(C) — 51 — 6)" J). 


Thi z 5 
T which will be referred to as the 
da MEE E (pı), is the expected probability 
‘aL do i response on the first postcriterion 
the fact yl labeled “fast” by virtue of 
Ai mea at he reached criterion on or before 
“slow? sor Ag The corresponding value for 
obtained b jects, the Median Slow E(p,), is 
Th Table summing from V + 1 to œ. 
True a 12 the four E(p,)’s, True Slow, 
eek eo Slow, and Median Fast, 
ing rate pair or each of 10 different learn- 
S at each of three different cri- 


teria. The criteria are 6, 8, and 10 consecu- 
tive correct responses. It will be noted that 
two of the learning rate pairs, .01-.01 and 
.03-.03, have identical values for the slow 
and fast subgroups. These are included to 
indicate the effect of using the median trial 
when the entire group is homogeneous. The 
tabled values illustrate a number of desirable 
and undesirable features of the criterion 
method in general and also of the median 
trial separation procedure. Four of these will 
be indicated. 

First, inspection of the True values indicates 
that for a given learning rate, the E(p.) in- 


creases as the criterion becomes more severe. 


For example, with a 0 value of .01 the True 
781 for the 


E(p:) values are .663, .730, and 
criteria of 6, 8, and 10 consecutive correct 
responses, respectively. This is in accord with 
the intuitive expectation that the harder the 
criterion the higher will be the average per- 
formance level when it is achieved (if learning 
is taking place). 

Second, it can be seen that for 
criterion, the True E(p:) values go up 


a given 
as the 
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learning rate increases. Thus for the criterion 
of six consecutive correct responses, as the 
learning rate ranges from .01 to .09, the True 
Fast E(p,) ranges from .663 to .865. This 
exemplifies the point made by a number of 
authors that the criterion does not exactly 
equate subjects who are learning at different 
rates, but rather the faster learner, on the 
average, will be somewhat above the slower 
learner on the first postcriterion trial. Al- 
though it is the case that the faster the 
learning rate the higher the E(p,), it does 
not follow that the greater the difference in 
learning rates the greater the disparity in 
E(p:)’s. For example, it can be seen that for 
the criterion of six consecutive correct re- 
sponses, a difference of .06 between the 0 
values .01 and .07 results in a difference of 
177 between the values .663 and .840, but 
a difference of .08 between the @ values .03 
and .11 results in a difference of only .129 
between .756 and .885. This is due to a ceiling 
effect. What can be said is that given a slow 
group with a certain learning rate, the greater 
the difference between that given rate and the 
learning rate for a fast group, the greater will 
be the disparity in their Æ (p+) values. 

Third, it can be seen that some of the Median 

values are inverted. When the learning rates 

for the two subgroups are identical (both .01 

or both .03) the Median Slow E(p:) is greater 

than the Median Fast £(p,). Thus when both 
subgroups have 0 values of .03, the Median 
Slow and Median Fast values are .824 and .696, 
.873 and .757, and .907 and .807, for the three 
criteria, respectively. The implication is 
obvious: If a group of homogeneous subjects 
is run to criterion and then divided on the 
basis of the median into fast and slow learners, 
the slow learners can be expected to behave as 
if they learned more than did the fast learners. 
And, on the average, they did. . 

It will also be noted that the inversion also 
can occur when the two subgroups differ in 
their learning rates. The instances of inversion 
have been underlined in Table 12. The pat- 
tern suggests certain other generalizations 
but these will not be enumerated here. ; 

The fourth feature of Table 12 indicates 
what has probably been the intuitive basis 
for the use of the median trial to criterion as a 
separating point. This is that as the learning 
rates for the two subgroups move farther and 

farther apart, the Median E(p,) values con- 


verge to the True values. 
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This finding of a possible inversion of the 
Median Slow and Median Fast E(p;)’s, the 
occurrence of which depends upon a number of 
parameters, appears to be of considerable 
significance with respect to interpretation of 
experimental results. Suppose, for example, 
that the median trial to criterion on a training 
task is used to separate subjects into fast and 
slow learners, and that on a transfer task the 
fast learners perform more poorly than do the 
slow learners. Under the assumption that the 
terminal level for fast subjects is higher than 
that for slow subjects, one fairly natural con- 
clusion would be that learning the first task 
produces interference on the second. The 
results presented in Table 12 indicate quite 
clearly that such a conclusion is completely 
gratuitous. It may actually be the case that 
learning of the first task is unrelated to per- 
formance on the second or that it is eve? 
facilitative, since it may actually be the slow 
learners who on the average have learned more 
in training. Without information concerning 
the learning function and the existing rate 
parameters (which if available would eliminate 
the need for a criterion) as well as the effects 
produced by the particular criterion selected, 
it is not clear how to interpret differences be- 
tween the fast and slow learners on a transfer 
task. 

“x Percent Overlearning” Procedure. Anothe! 
commonly used procedure is to give a subject 
“overlearning” trials, the number of which 
depends upon the number of trials the subject 
takes to reach criterion. Thus in the 100% 
overlearning procedure, if the subject reaches 
criterion on Trial z, he is given another » trials; 
in 50% overlearning he is given an additional! 
n/2 trials; etc. 

It has already been observed that the more 
homogeneous the group of learners the more 
will the criterion method inflate the variance 
of their terminal performance levels as com” 
pared with an identical group of subjects wh? 
are all run for the same number of trials, a? 
that this inflation is exactly the opposite of th? 
goal of using the criterion. Now consider ” 
group of identical subjects who are run to ie 
criterion. Some reach it early; some latë 
Because they are identical so far as their rat? 
parameters are concerned, the ones who rea” 
criterion early will have lower terminal pe 
formance levels than will those who reach i 
late. If the x%-overlearning method is a y 
ployed, the subjects who reach criterion eat 
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Fic. 1, The probability of reaching a criterion of 10 consecutive correct responses 


on or before Trial x, given the learning curve pn = 1—.5 (1 — 08)". 


ora fewer additional trials than will 
settee sd reach it late. Thus the overlearning 
tot n ape the injustice so far as the 
one T er of trials is concerned. Whether 
ofthe wn Produces an even greater inflation 
ee ae of terminal performance levels 
onesie ii ee the form of the learning 
used. Wh nd the particular criterion which is 
effets wil = subjects are not identical the 
baming vat be a upon the distribution of 
theo as well as on the function and 
paana da seston problem involved in this 
oes ete üppase that some criterion method 

vao? F (at least roughly) the performance 
eit rates. An of subjects learning at differ- 
Sete T 1 the overlearning procedure pre- 
Spread oat equating or will the subjects be 
numbers of Pa by the addition of different 
Upon the fo rials? In general, this will depend 
let £(0 rm of the function. As an example, 
can adi =1— = pd = 0). t 
wn that if £(,, 2.) = £@,, 7s), then 


f(9, ia ; 
»(\1+—— )n,) is approximately equal 


tof(@ ee 
fy (1 + am) ny). This means that in 


the 
case í i 
of the linear learning model, once 


subje; x 

entinted amning at different rates have been 

Preserve tha «%-overlearning procedure will 

amount), ne Sduating (except for a negligible 

learning modd surely this will not be true of 

fact that thet in general, and in view of the 
orm of the learning function is 


almost always in question, careful considera- 
tion should be given to the desirability of 
using this method. 

Inference of the Performance Level from 
Attainment of the Criterion. The solid curve 
in Figure 1 is the learning curve fn =1 
— .5(1 — .03)”—. The dashed curve is a dis- 
tribution function which gives the probability 
that a subject having this learning function 
wil] have reached a criterion of 10 consecutive 
correct responses by the nth trial; that is, it is 
the integral of the density function which gives 
the probability of reaching criterion on Trial 7, 
given this learning function. If a large group of 
subjects were all learning according to the 
given learning function, the dashed curve 
would give the expected proportion of subjects 
reaching criterion on or before Trial n. 

The two curves can be used to determine 
confidence intervals. For example, a line 
drawn parallel to the abscissa from .10 on 
the ordinate will intersect the dashed curve 
at the trial upon which 10% of the subjects 
can be expected to have reached criterion. 
In Figure 1 this occurs at Trial 25. A line 
drawn parallel to the ordinate from this point 
of intersection will intersect the solid curve. 
The value of pn at this second point of inter- 
section is then read on the ordinate to be 
about .76. Ten percent of the subjects will 
have a value of pn no greater than .76 on 
the trial they reach criterion. Tt may also be 
said that 90% of the subjects will have a 
value of pn greater than or equal to .767 (the 


———— 


12 


60 


RICHARD S. 


BOGARTZ 


= 50 
Z 
3 / 
2 aot a-/ 
40 Z — A 2-5 Ul-.01) 
30 f —— Probability of reaching 
criterion on or before 
aor oft ri triol n 
1Or 
ae 1 f fi ‘ 1 fi ii 1 fi fi 4 
10 20 30 40 50 60 70 80 90 100 IIO 120 130 140 150 
Trials n 


Fic. 2. The probability of reaching a criterion of 10 consecutive correct responses 
on or before Trial z, given the learning curve pa = 1 — .5 (1 — 01)», 


value of pa on Trial 26) on their first post- 
criterion trial. 

In Figure the learning curve pn = 1 
— .5(1 — .01)""! is plotted together with the 
corresponding distribution function for the 
same criterion of 10 consecutive correct re- 
sponses. It can be seen that with this learning 
curve 90% of the subjects who reach criterion 
will have a value of pẹ greater than or equal to 
.679. About 50% of the subjects will reach 
criterion before pa reaches .80, whereas in 
Figure 1 only about 18% reach criterion before 
pn reaches .80. 

These results illustrate the fact that the 
distribution of terminal performance levels 
obtained through the use of a criterion will 
depend not only upon the criterion which is 
used, but also upon the learning function and 
the parameters characterizing the group of 
subjects. Since the learning function and 
parameters are unknown, it will be difficult to 
infer the range of performance levels which 
has been achieved. This is not to suggest that 
it is impossible to put criteria to such use, but 
only to indicate that without additional in- 
formation to be used in conjunction with the 
selected criterion, it is difficult to support any 
conjectures as to the terminal performance 
level resulting from the use of a particular 
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criterion. 

Concluding Comments. When the criterion 
method is used to test the hypothesis that no 
learning has occurred, there are no implicit 
assumptions concerning the form of the learn- 
ing function or the learning rates present In 


the population of subjects. This is the case be- 
cause the alternatives are no learning versus 
learning of any sort. However, when the goal 
is to equate the terminal levels of a group © 
subjects, then it appears that additional in- 
formation is required if one is to choose between 
using a fixed number of trials and a criterion. 
It is not sufficient to rest usage of a criterion 
upon the unlikelihood that all subjects learn 
at the same rate. It must be shown that the 
difference in rates makes a difference, and this 
should not be done by indicating that some 
subjects reach criterion before others. 

There are instances where the criterion 
method is the most natural way to proceed. 
For example, in Theios’ (1963) study of 
avoidance conditioning, the model he used 
assumed subjects are eventually absorbed into 
a state in which only correct responses occur. 
He used a criterion of 20 consecutive correct 
responses. The problem is that this case is an 
exception. In far too few instances is the 
model or class of models and the implica- 
tions for use of the criterion method made 
explicit. 
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aalo I consists of the criteria in which S 
vi hes criterion by making at least K correct 
agen on each of L consecutive blocks of M 
thes oe CR for i blocks of M trials will be 
b. agg that criterion is reached on or 
ices i ith block. To obtain this distri- 
qos unction it will suffice to derive the 
ad function and then cumulate it. 
Kon a bability of exactly © correct re- 
ies in any block of M trials will of course 
ave the binomial distribution 


M! 


B(x, M, p) = apt 


(= pi. 


Th abil; 
i € probability of at least K correct responses 
simply 


M 
È d(x, M, p) 


z=K 


which is d 
eno i 4 

and $). ted by B (suppressing K, M, 

I 
il pd ot the event Cj, attainment of 
on Blodet : hia 
ish obese: ck i, to occur, three conditions 

Í: Base 
bach a so K correct responses are made on 
$ (thi the L consecutive blocks i — L + 1 to 

A ae with probability B”). 
made = wk K — 1 correct responses are 
a i ock i — L, otherwise criterion would 
with en reached on Block 7 — 1 (this occurs 

h Probability 1— B). 
n ; 

ie E first 7 =L — 1 blocks, no more 
than K — {ping blocks contain more 
asserts th correct responses. This simply 
the first = criterion has not been reached in 
that it oes be — 1 blocks. (The probability 
function been reached is the distribution 


i-L-1 


PG-L-1)= E Pr(C); 
i=l 
therefor, 
e 
reached ae Probability that it has not been 
Ising then, t = — 1). 


Product rule for the probability of 


the joi 
Nt oc 
c . 
Urrence of independent events, we 
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obtain the recursive expression 
Pr(C) = [1 — Fü- L-1) -— B) B” 


for the probability density function. 

For L = 1 it is easily shown by induction 
that the probability density function is the 
geometric distribution, Pr(C;) = B0 — B^; 
with mean B~ and variance (1 — B)B™. The 
CR is 


i 

Fi) = X BU -= Bit =1-— (1 B). 

j=l 

Thus, if we plot, as a function of blocks, the 
probability that S has reached criterion on or 
before that block, we obtain for L=1 a 
negatively accelerated curve with asymptote 
at unity and with B as the rate parameter—a 
“learning” curve without learning, so to speak. 

The value of B for particular values of K, 
M, and p is easily obtained with a desk 
calculator or tables. It is then a simple matter 
to obtain the largest ¢ such that F(i) is less 
than or equal to a selected CR. 
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A Class II criterion is of the form K correct 
responses in any M successive trials. With this 
type of criterion a running analysis is made 
over trials 1 to M, 2 to M+41,3 to M+ 2, 
etc., until M successive trials occur which 
contain K successes. The first such occurrence 
constitutes attainment of the criterion. 

For n < M the probability of Cna attain- 
ment on Trial 7, is the probability of a suc- 
cess on Trial n and K — 1 successes in the 
n — 1 previous trials. We have then 


nck 


(n—1)! : a-K 
PCY =a HEY z 
K<nsM 


For n > M the problem is somewhat more 
an be shown, however, that 


r the hypothesis of chance 
h trial, a Class II criterion 
h one absorbing state | 


complicated. It c 
for n > M, unde 
performance on eac 
is a Markov chain wit! l à 
(see Feller, 1957, Ch. XV, for a discussion of 
Markov chains). Three steps are required to 
show this: a set of possible states of the chain 
must be defined, an initial (Trial M) proba- 
bility distribution for these states must be 
specified, and a matrix of transition proba- 
bilities p;x must be specified. This matrix 
which gives for all j and k the probability of the 
system being in State k on the next trial, given 
that it isin State j on the present trial, must be 
independent of the trial number. 

To define the states of the chain w 
the 2” different sequences of zeros 


e first 


consider 


———————— 
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and ones (corresponding to errors and correct 
responses), 111... 11, 111 oa 10, ww ny 
000 . . . 01, 000 . . . 00, which are each of 
length M. Setting aside the question of 
criterial performance, we may observe that at 
each step in the running analysis of a response 
protocol we have in hand one of these 2¥ 
sequences, and that given that sequence, only 
two sequences are possible at the next step. 
For example, if M is 5 and we observe the 
sequence 00110 at Step z, then at Step n +1 
we shall observe either 01101 or 01100. The 
initial digit on the left is dropped (we move 
forward one trial), and either a 1 or a 0 is 
added on the right, depending upon the out- 
come of the next trial. We observe 01101 if a 
Correct response occurs or 01100 if an error 
occurs. 

Thus if there were no interest in a criterion, 
each of the 2” sequences could constitute a 
state in a Markoy chain. Each row of the 
transition matrix of this chain would consist 
of all zeros except for two entries which would 
be p and 1 — $, the probability of a correct 
response and an error, respectively. 

To incorporate the criterion into this ap- 
Proach we combine the 


M M! 

S= arar n 
= iar —y)! 
Sequences having K or more l’s into a single 
absorbing state, A. The remaining 2” — 5 
states are transient, Definition of these 
2@ — S + 1 states completes Step 1. 

The initial probability of being in State 4 
is simply the probability of reaching criterion 
on or before the Mth trial 


M 


(x — 1)! i 
2a- pU — pjk, 


The initial probability of the system starting in 
a transient state which contains i Usand M — i 
0's is simply pi(1 — p), 

The transition matrix of this chain consists 
of one row with the first entry equal to 1 and 
the remaining entries equal to 0, plus 2¥ — 5 
rows having all 0’s except for two entries which 
are p and 1 — p. For example, the criterion 2 
out of 4 would have as its vector of initial 
probabilities and its transition matrix 


A 1000 0100 0010 0001 0000 
a A ji 0 0 0 0 0 
pU- p)? 1000/0 0 0 p 1-p 
ae —ż)? | and 0100|p 1—p 0 0 0 0 
p(l—p)® 0010 | b 0 1—p 0 0 0 
p0 —p)3 0001; p 0 0 1-p 0 0 
(1 —p)t o000;0 0 0 0 bp 1-p 


where a = 1 — 4p(1 = H- 1 - b)i. 
Clearly the probability of reaching criterion 
on Trial M + t is simply the probability that 


the system enters the absorbing state on 
Trial £ The initial probabilities for the 
transient states and two sets of recurrence 
relations given in Feller (1957, p. 363) deter- 
mine the probability of absorption for cach 


value of ¢. ; 
We now develop, in the following para- 
graphs, the approximation method used to 


obtain the results for values of M greater that 
10 (Tables 9-11). This method is used because 
the number of states in the transition matrix: 
24 — S+1, increases exponentially with M 
giving transition matrices that rapidly become 
unmanageable even with a computer. The 
manageability problem is eliminated by using 
an approximate distribution function iq 
absorption (i.e., attainment of criterion) as j 
function of trials. The derivation of this ak 
proximate distribution function rests upon oe 
Approximation Assumption that the probe 
bility of being in a Particular transient stat® 
given that absorption has not taken place 
remains equal over trials to the initial proba 
bility of being in that state. t 

Let F(n) approximate the probability tha 
absorption takes place on or before the nth 
trial of the process, It therefore approximate! 
the probability of criterion being reached on 0 
before Trial M +n of the experiment. Fof 
absorption to occur on trial n+ 1 of the 
Process, three conditions must obtain: 

1. Absorption has not occurred previously 
(the Probability of this is 1 — 

2. The system on Trial » of the process must 


K — 1 1's in the remainin 
which denote K — 1 correc 


the approximation assu i 


is eimai 
(1 — p)b(K = 1,1 po e tmp 


occurs on Trial n +! 
ability of this is p). 

he approximate proba 
is absorbed on Tri 


of the process (the prob. 

We have, therefore, t 
bility that the system 
n + 1 of the Process 


Fn +1) — Pln) = DA = F(n)), 


where D = (1 — pac — 1, M — 1, p). TH 
solution to this difference equation in F(n) i$ 


F(n) = f — (i = F(0)) (1 — D)”, 
where F(Q) is 
bility that the 


state (i.e, tha 
before Trial M 


the previously obtained proba” 
system starts in the absorbi? 
t criterion was reached on ? 
of the experiment) 


(Received August 20, 1964) 
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USE OF THE LIKELIHOOD FUNCTION IN INFERENCE? 
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Universily of Waterloo 


The likelihood function is defined and its use illustrated by a simple coin-tossing 
experiment. The distinction between the use of the likelihood function and the use of 
a test of significance is emphasized and illustrated by a simple genetics example. 
Some examples are given of experiments in psychology where the likelihood function 
is used to analyse the resulting data; the relative merits of the use of likelihood com- 
pared to other more standard methods of analysis are discussed. 


The importance of the likelihood function 
has been emphasized for at least 40 years by 
Fisher (1922, 1925, 1934, 1946), and more 
recently specific examples of its use have been 
given (Barnard, Jenkins & Winston, 1962; 
Birnbaum, 1962; Fisher, 1955). In fact there 
is a “likelihood school” of statistics which 
says that all the information of the sample 
relative to the population under consideration 
is contained within the likelihood function and 
inferences should be based on it (Barnard et al., 
1962; Birnbaum, 1962; Fisher, 1955; Sprott, 
1961). The Bayesians (Edwards, Lindman, & 
Savage, 1963) also emphasize the importance 
of the likelihood function, but the use they 
make of it is different from that in this paper. 
Although the use and importance of the likeli- 
hood function are now being discussed in the 
statistical literature, there seems to be very 
little use or recognition of it in outside fields 
such as psychology. The purpose of this paper 
is to describe the likelihood function and its 
use in statistical inference. Section I defines 
and discusses the likelihood function and rela- 
tive likelihood function and illustrates its use in 
inference by a simple example. Section II com- 
Pares the likelihood approach with the some- 
what similar, but logically distinct significance 
test. Section ITT gives an example from psy- 
chology of inferences based on the likelihood 
function from the data that would be difficult 
or impossible to assess accurately and rigor- 
ously in any other way. It is pointed out that 
perhaps the main drawback to the use of like- 
lihood as a measure of uncertainty is that it is 
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not probability and so is not intuitively obvious 
to some people. It has however a definite 
interpretation in terms of probabilities, and it 
should also be noted that very few methods of 
inference are capable of making probability 
statements about hypotheses or unknown 
parameters. The test of significance and con- 
fidence intervals certainly do not. 


SECTION I: LIKELIHOOD 


The use of likelihood is illustrated in the 
following example: a coin which turns up heads 
an unknown fraction, 0, of the time is tossed 
10 times in order to obtain information about 
8. Before the experiment is performed, the 
probability of observing « heads in 10 tosses 
for any specified value of 8 is given by the 
binomial distribution 


fæl = ti 6=(1 — 9) 
% 

and can thus be calculated numerically for any 
x. After the experiment a particular value 
x = xo has been observed. We can now use 
f(xo]0) as a function of 0, employing the p 
served value, æo, to rank possible values of 
according to their plausibilities. Suppose be 
xo = 1 head was observed. For any given the 


probability of the observed result is 


£(1|0) = (?)ea — 6. 


For example, if 0 = .1, the probability of ob- 
serving one head is 


f(1].1) = (7) (1) (9)? = .387, 


and if 0 = .5, the probability of observing 


[i] 
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one head is 
TIDE (?) (.5)(.5)? = .00977. 


Thus under hypothesis H: 6 = .1 the result of 
the experiment is 


387 
| 
-00977 


times more probable than under H: 0 =.5, 
The data favor the hypothesis 0 = .1 40 to 
1 over the hypothesis 0 = .5. This is ex- 
pressed by saying the likelihood of 0 = .1 is 
40 times that of @ = .5, or the likelihood of 
0 = .1 compared to 0 = .5 is 40. Since hy- 
potheses under which the observations are 
relatively probable will be considered more 
plausible and hence preferable to those under 
which the observations are relatively improb- 
able we can use the function f (110) to compare 
the plausibilities of different hypotheses in the 
light of the observation x = i. 

In general, before the experiment, f(x|6), 
considered as a function of the observation æ 
for a specified value of 9, yields the probability 
of observing x; after the experiment, f(x|0), 
considered as a function of the unknown vari- 
able 6, and where x is set equal to its ob- 
served value, gives likelihoods of possible 
values of 6; f(x|0) is called the likelihood 
function of @ and can be denoted by L(6|x). 
Thus, before the experiment probabilities are 
relevant; after the experiment likelihoods are 
relevant (Barnard, 1949), 

As can be seen from the above example, the 
likelihood is used to compare the plausibilities 
of different hypotheses under a given set of 
observations, so that only likelihood ratios are 
meaningful. In fact the likelihood ratio 


L(6;|x) 
L(62| <x)’ 


[2] 


which is the plausibility of ð = 6, versus 
0 = bz, is the ratio of the probability of the 
observations if @ = 6, to their probability if 
0 = bz. Since only ratios are used, the likeli- 
hood may be multiplied by any factor C(x) 
independent of 0, for the ratio 


C(x)L(61| x) _ Lila) 
C(w)L@2|") — L (6/2) 


remains unchanged. Thus in the above ex- 
ample the likelihood function for + = 1 can 
be taken to be 


a(1 — 6)9, [3] 


Since it is desired to rank values of the 
unknown variable 8 according to their plausi- 
bilities, special attention will center on the 
most plausible value or maximum likelihood 
estimate Ê of 6; the likelihoods of all other 
values of Ø can be compared with that of the 


most plausible value 0 by means of the likeli- 
hood ratio 


R(6|x) = L6|x)/L(6| a). [4] 


R(@|x) varies between zero and one and gives 
the relative plausibility of 6 compared to the 
most likely value Ø = 6 in the light of the 
observations. Any two values of 0 can be com- 
pared by taking the ratio of their relative 
likelihoods, for 


ROl LOLO _ Lela) 
Rola) ~ L(@e|x)/Z@|a) ~ Llay 


which by Expression 2 is the plausibility of 
6=6, versus 0 = 6. In the first example, 
where x = 1 is observed, the maximum likeli- 
1 


hood value 6 is 10: inserting this into Expres- 


sion 3 gives the maximum of the likelihood as 


“s(-8) (8): 09 


and the ratio L(1|6)/L(1|6) of Expression 3 
and Equation 5 is 

= 91-8) _ a(t — ooo 
R(0|1) = Ti\ ono =% ar [6] 


10/\ 10 


By direct substitution into Equation 6, the 
values R(6|1) can be calculated for various 
values of 8. Some results are given in Table 1. 
In Figure 1, R(0|1) is plotted as a function of 
9. All values of 6 between .01 and .35 are fairly 
plausible. Values of 9 smaller than .001 or 
larger than .5 are suspect, in that there are 
available hypotheses (e.g., 0 = -1) which are 
40 times as likely. Values of 6 as large as «7 
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are obviously extremely implausible in the 
light of the observation x = 1 and can be dis- 
regarded, as there are hypotheses under which 
our observation is 1000 times as probable. A 
discussion of a similar example will be found 
m Fisher (1955, pp. 66-73). 

It can thus be seen that R(6|x) is a measure 
of uncertainty or plausibility applicable to an 
unknown variable @ for given experimental 
observations x. Like probability, R(8|x) varies 
between 0 and 1. Likelihoods are not probabili- 
tes however and the distinction is an important 
One to preserve. Likelihoods do not obey the 
rules of combination of probabilities. Although 
this is unimportant when only a single un- 
known variable, 0, is present, it becomes 
troublesome when there are many unknown 
Variables, 6,-- “On. 


Section II; LIKELIHOOD AND 
Test OF SIGNIFICANCE 


in a Standard or classical method of examin- 

frets e foregoing example would be by means 

Nific e test of significance. To test the sig- 

x oe of 0 = .5, the probability of observing 

eq = than or equal to 1 or greater than or 
lual to 9 is calculated: 


O 


10 
+(10)(3) = 
10 2 

en TH: = 5 is significant at the .021 level. 
can be noted that the relative likelihood for 

is 0356 tained in Section II for this example 
an atte (Table 1). The test of significance gives 
in ,_cthative way of examining the evidence 


this ex Š 
Xample. Sever: t e noted 
however. ple. Several points should b 


signi either the likelihood nor the test of 
ability ance is capable of making any prob- 
Nown Statements about hypotheses or un- 
esis wiih Oe Under the latter, an hypoth- 
does ch is significant exactly at the 5% level 

not necessarily have a 5% probability 


>» an event as unfavorable as the one 
the ed has a 5% probability of occurring if 
Vielg Pothesis is true. The likelihood ratio 
Statements of relative likelihoods of 


TABLE 1 


RELATIVE LIKELIHOODS FOR CoIN-TosstnG EXAMPLE 
IN WHICH x = 1 was OBSERVED 


o R(0/1) 
0 0 
-001 0256 
01 .236 
05 S14 

10 1.000 

15 897 
20 683 
25 486 
30 313 

35 187 

40 105 
45 .0535 
50 0255 
55 .0107 
50 00406 
65 .00133 
.70 .000346 


hypotheses; these are logically different from 
probabilities. 

2. The test of significance makes no 
logical distinction between certain values not 
observed and the actual value observed. For 
instance, in the example the value « = 0, which 
was not observed, is utilized in the same way 
as x = 1, which was observed. Such tests have 
been criticized on this ground (Fisher, 1955). 
The likelihood, on the other hand, utilizes only 
the value of x actually observed, implying that 
other values of x not observed are irrelevant. 

3. The likelihood function can be used 


Rr (|D 


vur ow oy ovo 


2345 6 7 8 9 10 
e 


a 


c. 1. Relative likelihood for coin-tossing example, 


s x = 1 head observed. 
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R (0) 


-ruranan vo 


© 


-54 55 56 57 58 59 60 6! 62 63 
o 


Fic. 2. Relative likelihood function for genetics example. 


only to compare the plausibility of one hy- 
pothesis with that of another in the light of the 
data, In this sense the likelihood shows quanti- 
tatively the direction in which the weight of 
the evidence is pointing. The test of significance 
shows whether a specified hypothesis is con- 
sistent with the data. It should be noted that 
some hypotheses consistent with the data may 
still be relatively implausible, Indeed, the fact 
that an hypothesis is not significant merely 
means that no convincing evidence against the 
hypothesis has 
but there may be much more 
which the relative likeli- 
example can show the 
a test of significance and 


Example: If there are only two forms, M 
and N, of a gene at a given locus or place on 
the chromosome, then the genotypes MM, 
MN, NN are expected to occur with relative 
frequencies 


P, 20— D; = 6), E 
respectively, if mating is at random with 
respect to these genes, where 0 is the relative 
frequency of the M gene in the population? 
(cf. Li, 1955). Using the observations cited 


2 Tf the relative frequency of the M and N genes is 
8 and 1 — 4, respectively, then the probability of having 
two M genes is 62, of having two N genes is (1 — 0), 
and of having an M and an N gene in either order is 
20(1 — 0). This assumes that the acquiring of the M 
and N genes are independent events like the tossing 
of a coin, 
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by Li (1955, p. 13) in which the frequency of 
the MM, MN, and NN blood types in a sample 
of 1,029 humans was observed to be 342, 500, 
and 187, respectively, the distinction between 
the use of a test of significance and the use of 
the likelihood can be illustrated. (a) Use of 
significance test—The significance test will 
indicate whether the Statistical Model is 
Consistent with the data at all. The probability 
of observing 342, 500, and 187 is given by the 
trinomial distribution with relative frequen- 
cies given by Model 7: 


Pr(342, 500, 187) = — $1029! 
Benes aB ~ 3421500! 187! 
X [20(1 — 6) mr (1 — 0p 

1,029 ! 2500 


(62) 


ouss(—9) 


243 1500! 187 ! 
Thus the likelihood of 0 can be taken to be 
gusa ox 6) 874 


[8] 


which attains its maximum at @ = 6 = .5753. 
Using this value for 0, the expected = 
quencies of MM, MN, NN in a sample 
of 1,029 are, by the theoretical frequencies i? 
Model 7, 1,029(.5753)?, 1,029(2)(.5753) (4247) 
1,029(.4247)" which are 340.6, 302.8, and 185.6: 
Comparing these with the observed frequencies 
342, 500, and 187 by means of a chi-square 
test, and hoting that ldf has been used tO 
estimate the most likely value of @ (cf. Fishers 


observing a chi-square 
larger than .032 is 85%, there is no evidenc? 
that the Model 7 is inconsistent with thes? 
data. (b) Use of the likelihood—since the 
overall model above can be accepted as ap” 
Propriate, the likelihood can now be used to 
rate the relative plausibility of values of 4 
The maximum of the likelihood can be found 
by substituting 9 = ĝ — 95753 with Expres- 
sion 8 to give 


(5753)"*(.4247)30, 


The likelihood ratio obtained by dividing 
Expression 8 by its maximum in accordanc® 
with Equation 4 is 


RO) = ousy — 8)°/ (.5753)"5( 447)304, 


and this measures the plausibility of @ rela 
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tive to the most likely value @ = 6 = .5753. 
In Figure 2, R(@) is plotted against 8. Values 
of @ outside the range (.54, .61) can be dis- 
regarded as the data overwhelmingly favor 
values inside’ the range. 

Had the test of significance in @ above 
yielded an improbably large value of xX, the 
entire Statistical Model 7 would have to be re- 
jected as incompatible with the observations. 
Then the use of the likelihood as above would 
be irrelevant; it would be meaningless to ask 
about the relative plausibilities of values of 8 
m a model that was demonstrably incompatible 
with the data. 

The test of significance can therefore be used 
to test the validity of the overall model, and, 
if the model is accepted as appropriate, the 
likelihood can be used to rate the relative 
Plausibility of the unknown variables in it. 
The likelihood in fact indicates the direction in 
Which the weight of the evidence is pointing 
Once the hypothesis as a whole is found 
acceptable, 


Section III: AN EXAMPLE FROM 
PsyCHOLOGY 


The following data arise from an experiment 
by Banks and Vogel-Sprott (1964). The ob- 
Servations are the number of punished re- 
Sponses in 15 opportunities for response under 
four different delayed punishment times of 0 
Seconds, 30 seconds, 60- seconds, and 120 
Seconds, The results can be presented in a 
frequency table (Table 2) in which an entry 
1i; in the body of the table gives the observed 
Tequency of subjects in the ith delay-time 
Stoup who gave j responses. 
weeeetfic questions under consideration are 

nether the number of responses depends on 
dai, delay-of-punishment time and whether 

ay times 30, 60, and 120 differ among 
“a and from 0 delay. An analysis 
eh g number of responses as a continuous 

*tmal variate is questionable because the 
ata are discrete and the distribution is 

Shaped and truncated at 15. A more ap- 
frobriate procedure would be to use the above 
Gan table which contains counted (multi- 
fa ial) data. The chi-square test of signifi- 
leg ge be used because of small frequen- 
Age he table could be condensed and classes 

mbined, but much information would be 


TABLE 2 


FREQUENCY OF PUNISHED RESPONSES 


Number of Responses 


Delay Time} 1 2 3 4 5 6 7 8 
Oseconds | 2 3 0 2 0 2 0 0 
30seconds | 0 1 1 1 0 Q o 1 
6Oseconds |0 1 0O 1 0 1 1 0 
120 seconds | 0 1 oo f © © 0 
Total 2 6 1 4 0 3 1 if 

9 10 11 12 13 14 15 |Total 
Oseconds |0 O 1 0 0 O OO); 10 
30seconds |0 0 0 2 0 O 4] 10 
60seconds |0 0 0 0 0 1 5 10 
120seconds |0 0 0 0O 1 2 6j 10 
Total 0 0 1 2 1 3 15 40 


lost. An extension of Fisher’s (1946) exact 
test of significance for 2 X 2 tables could be 
used in principle, but even with a high speed 
computer this would not be feasible. Thus it is 
difficult to test whether the above hypotheses 
are actually inconsistent with the data (see 
Section II, 3). The use of the likelihood how- 
ever is computationally simple and can throw 
some light on which way the experimental 
evidence is pointing. 

Suppose pi; is the hypothetical probability 
that a person makes j responses given delay 
time i (i = 1-4, j = 1-15). The hypothesis 
of independence between number of responses 
and delay time is mathematically 


Hh: [10] 


The actual frequency of j responses under 
delay time i is m; (Table 2). The above data 
then follow multinomial distributions, and the 
likelihood function is the product of all pis", 
which for notational convenience can be 


written as 
JI pa. [11] 
ij 


pij = p; independent of i. 


This example is somewhat more complex than 
the preceding examples, as the likelihoods of 
particular values of pij are not in question; 
rather we are concerned with the likelihoods 
of sets of p,; satisfying Equations 10. In order 
to see if Hı is reasonable, we can evaluate 


| 
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TABLE 3 


FREQUENCY OF PUNISHED RESPONSES 


Number of Responses 


Delay Times 


1-5 6-10 11-15 Total 
0 seconds 7 2 1 10 
30 seconds 3 1 10 
60 seconds 2 2 6 10 
120 seconds 1 0 9 10 
Total 13 5 2 40 


the likelihood of the most likely set of values 
pij satisfying Equations 10, that is, maximize 
Expression 11 subject to Equations 10 and 
evaluate the relative likelihood of the resulting 
numbers p;; = fj. If the most likely values of 
pi satisfying Equations 10 are unlikely, then 
all values satisfying Equations 10 are unlikely, 
so that H; itself is to that extent implausible. 

The maximum likelihood values pis = Pi 


E : Nj 
satisfying Equations 10 can be shown to be %4 


n 
where 7; is the total of Column j and wis the 


overall total. Substituting p;; = 6; = "4 into 

n 
Expression 11 using the observations nij in 
Table 2, the resulting likelihood is 


(30) Gn) ENEN (in) Gs) 
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nothing to the products. The relative likeli- 
hood of the most likely set of values p;; satis- 
fying Equations 10 is extremely small sp ihi 
Hı, even though it might not be contradicte! 
by the data, is a relatively implausible hy- 
pothesis certainly not favored by the data. 
The evidence is pointing strongly away 
from Hy. ; 

It may be thought that the delay times 
differ in respect of the rather erratic responses 
in the middle of the scale, that is, between 
and 10. If responses 1-5, 6-10, 11-15 are con” 
bined, the result is Table 3. The hypothes!= 
Hipi; = pj, can be tested in the same ways 
and the relative likelihood of the most likely 
values of p;; under H; is 

10491 388552 


——— — = 2.2% 10%. 
404°772236522226 599 

It can be seen that even under this less sens 
tive classification the evidence does not fav 
the hypothesis that delay time produces no 
effect. 

The hypothesis H» that delay times 30, 601 
and 120 produce the same effect but possibly 
differ from 0 delay time is more complicate“ 
to test. Here the hypothesis is 


He pis = pj; i = 2,3, 4 only. 


The most likely values of pi; satisfying Ha C4” 
be shown to be 


x (A) 4/27 17 3 3715) 
40 40 40 40 /\ 40 40] ° 

The maximum of the likelihood is obtained for 
py = = where 7. is the total of row i; this 


i 


can be written, substituting in Expression 11 


nij nij 
mC) 
The likelihood ratio or relative likelihood of 


"OE: 
pi = 


two expressions, which, using the entries in 
Table 2 is 


=I] mT am [12] 


above is therefore the ratio of these 
n 


10402266443 32233 1 5! à 


LOMPIPPPLSIpG T 9 X 10%, 


ignoring 0’s and 1's, since they contribute 


a Mj a 
Py = —, å 


nj 
n 


= Ig 


= pi 


zi = Paj = Pas = 

N — My 
The likelihood of these v. 
obtained by substitutin 
using the numerical values in Table 3; th® 
maximum of the likelihood is again foun 
from Equation 12. The likelihood ratio arising 
from the results in Table 3 is ther 


alues of the p’s can bê 
g into Expression 1 


efore 
10969332 1217792 
fui = C 
3001007723162269 ~ 098. 


Thus the evidence favors Ha, the equality of 


delay times 30, 60, and 120, over Hy, th? 
equality of delay times 0, 30, 60, and 120, about 
446-1 (- __ 098 
2.2 X 104 
is itself fairly plausible, so that it is reasonabl® 
to conclude that delay times 30, 60, and 12 
produce essentially the same effect, the 0 tim? 


) Also the hypothesis Bs 
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being responsible for the main differences 
observed. 

The above tests exemplify the use of the 
likelihood in tables with entries too small to 
be attacked conveniently in any other way. 
Of course these tests, like the standard sig- 
nificance tests, are valid only if the hypotheses 
tested were considered relevant before the 
experiment was performed ; they are not valid 
if the hypotheses tested were suggested by 
Inspection of the data. 


ANOTHER AREA OF APPLICATION 


Section TIT dealt with an example in which 
the use of the likelihood appears to be the most 
convenient method of attack. A system in 
Which there mav be feedback of an unknown 
kind provides an example where the use of the 
likelihood function is the only method of 
attack, For instance, an observational variable 
Yı may be normally distributed about a mean 
Bx, with constant variance o°, Where s is an- 
other observational variable at time t. If the 
X are independent of all ys standard regression 
analysis applies. If there is feedback of the 
form t= Yn standard regression theory 
does not apply and the analysis using Sig- 
nificance tests and probability (fiducial) in- 
tervals becomes complicated although still 
Possible, Tf, however, there is feedback of an 
unknown form M = g(ya) where g is an 
unknown fanietion; possibly independent of 
Yea (ie., no feedback), then significance tests 
and probability intervals cannot even in prin- 
ciple be obtained as the required distributions 
will not be known. This kind of situation may 
Possibly arise in experiments in which learning 
takes place. The likelihood function however 
will not depend on knowing the form of the 
unction g(y,_;); all that must be known is the 
numerical value of x; = ge): Whatever the 
rm of g(y,1), the likelihood function based 
= a sample of n pairs of observations: May Yay 
Ta, Yo, +. ‘En Vn is 


(Y e|- a Eo- sr), 


whic ER : 
hich attains its maximum at 


Bafa Dim oo gal dor he 


96 
34 
92 
s0} 
88 

Me 
-86 
84 


82 f 


80) 


78 —— 
72 74 76 78 


BO 82 
Si 


84 86 88 


Fic. 3. Contours of constant likelihood 
for the Solomon-Wynne data. 


Thus the relative likelihood can be calculated 
numerically for any pair (8, g). For any value 
B = Bo, one can calculate the most likely value 
of o? (i.e. o? given Bo); if the relative likeli- 
hood of the resulting pair (8o, o°) is small, then 
the value, 8o, itself is implausible, since no 
value of o? will yield a pair (Bo, o°) with high 
likelihood. Examples of this kind are discussed 
in more detail by Barnard et al. (1962). 

A final example of the use of the likelihood 
function is provided by the stochastic learning 
model of Bush and Mosteller (1955, pp. 238- 
249) applied to the Solomon-Wynne experi- 
ment. Using this model, the probability of a 
shock on Trial n when it is preceded by avoid- 
ances on k trials is 


qn = ata" E, 


where a is called the avoidance parameter and 
ag the shock parameter. The likelihood func- 
tion is therefore a product of terms, one for 
each trial, of the form 


atak or 1 — aar t, 

the former occurring in the product on shock 
trials, the latter occurring on avoidance trials. 
Since, in the Solomon-Wynne experiment, 
there are 30 dogs each with 24 trials, there are 
720 factors in the likelihood function, making 
it too difficult to write down or compute on a 
desk calculator. The methods of Bush and 
Mosteller (1955), based on using suitable sub- 
sets of observations, could be used to write 
down an approximate likelihood function 
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amenable to a desk calculator. However, it is 
relatively simple to program a high speed 
electronic computer to calculate exact likeli- 
hood values, using all of the data to obtain 
the maximum of the likelihood, and by Equa- 
tion 4, the relative likelihood. This was done, 
and the results are shown graphically in 
Figure 3 in which contours of constant likeli- 
hood are plotted in terms of a, az, and the 
reference line a1 = a2 is shown. This graph 
represents a concise pictorial summary of 
everything the data have to say about a; and 
az. It can be seen from the elongated shape, 
for instance, that the data throw more light 
on a than on ay; that is, whatever be the value 
of as, plausible values of œ are in the range 
(.73, .840), whereas whatever be the value of 
a, the plausible values of ws are in the range 
(.89, .954)—the extremities of the 1% con- 
tour—the one range being about twice the size 
of the other. Also, the likelihood decreases very 
rapidly for small changes in a, and a, but 
about twice as rapidly for a». Thus, in terms 
of relative likelihood it is important for a; and 
az both to be near the maximum. The region of 
appreciable likelihood is concentrated well 
above the line a, = az so that there is little 
doubt that the evidence points strongly to os 
being greater than a. The contours of constant 
likelihood also indicate how knowledge of one 
parameter will affect an inference concerning 
the other. For instance, if a; is known to be 
.746, the maximum of the relative likelihood is 
.1 and occurs at a = .935, so that values of az 
in the range (.89, .954) now have a relative 
likelihood of at least 01/1 = 1, rather than 
-01 as before. This approach can be compared 
with the more usual large sample approach 
used by Bush and Mosteller (1955) where the 
maximum likelihood estimates are obtained 
approximately as @, = 197, & = 923 with 
asymptotic standard errors of about .04 and 
.015, respectively ; they also give a theoretical 
interpretation of the results, 

It can be seen from this 
examples that the use of 
several advantages: 

1. It represents a concise summary of all 
of the information in the data, and when 


graphed, it gives a pictorial summary of what 
the data have to say. 


and the preceding 
the likelihood has 
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2. Inferences based on the likehood are | 
exact for samples of all sizes; no assumption | 
or mathematical investigation is required to 
ascertain if the sample is large enough to 
justify the method. DE. 

3. The rather complicated mathematics 0! 
asymptotic or large sample theory is not ae 
sary, giving a relatively simple theoretica 
approach to inference. ; : 

Against these may be set the possible dis- 
advantages of increased numerical computa 
tion in addition to the uncertainty ae 
expressed (exactly) in terms of ae 
rather than (approximately in large samples 
in terms of probability. Also, the ieee 
method is not a test of significance and canno? 
be used to test the validity of a single by 
pothesis without reference to alternatives. 
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This article reviews the result 
the relationships between properti 
and job behavior in business an 
structural properties were examine 
span of control, subunit size, total- 
tralized/decentralized shape. At least 
exceptions being span 
found to be significant! 
iables. Implications of thes 
are discussed. 


es o 


All organizations are structured, in the sense 
of having positions and parts which are sys- 
tematically related to other positions and 
parts, Since organizations vary in their struc- 
ture, it is appropriate to examine the question 
of whether differences in the structure of or- 
Sanizations are related to differences in the 
attitudes and behavior of their members. The 
enon of this article is to review the results 

Published research (prior to 1964) relevant 
ant relationships existing in business and 

ustrial organizations. 
of ough organizations have been the focus 
tine aici and interest in sociology since the 
in png Weber’s first writings on bureaucracy 
she e 1920s, and in industrial psychology 
ney the time of the Hawthorne studies in 
hehe 1930s, it has been only in the past 
atte Or so that intensive and concentrated 

€ntion has been given to organizations in 
po behavioral sciences. The major develop- 

nt in this area in psychology over the past 
« Years has been the advent of so-called 
as os organization theory.” Such theory, 
veloped by Likert, Haire, McGregor, 
Syris, and others, has been advocated as a 


Cc : 

Ontrast with, and improvement on, so-called 
1 
‘ta Or are indebted to the following indi- 
earlier or their comments and suggestions regarding 

versions of this paper: Chris Argyris, Peter 
lau, Edwin E. Ghiselli, Thomas M. Lodahl, 
also ik Strauss, and Victor H. Vroom. We should 
ke to acknowledge the financial support of the 


nsti aeath A 
a of Industrial Relations, University of Cali- 
la, Berkeley, 


Georg 


of control an 
ly related to one or more attitude or behavioral var- 
e findings for organization theory and future research 
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s of empirical field studies that have investigated 


f organization structure and job attitudes 


d industrial organizations. The following 7 
d: organizational levels, line/staff hierarchies, 
organization size, tall/flat shape, and cen- 


5 of these 7 variables (with the possible 
d centralized/decentralized shape) were 


“classical organization theory” as developed 
by Fayol, Urwick, Taylor, and others. Whereas 
the classical theorists, with their discussions of 
chain of command, specialization of function, 
span of control and so forth, placed heavy 
emphasis on factors related to organization 
structure, the modern psychological theorists, 
such as those mentioned previously, have paid 
relatively little attention to the structural 
aspects of organizations. Bennis (1959) aptly 
summed up this point several years ago when 
he pointed out that classical theorists talked 
about “organizations without people,” while 
modern theorists often seem to talk about 
“people without organizations.” Instead of 
emphasizing the structure of organizations, 
latter-day theorists have advocated supervi- 
sory human relations training, T groups, 
attention to the nonfinancial needs of the em- 
ployees and the like, as methods for improv- 
ing organizational administration and leader- 
ship. The efficacy of such methods, however, 
may well be reduced by failure to take into 
account the structural characteristics of or- 
ganizations. 

Not only have organization theorists tended 
to ignore structural variables, but also so have 
many of the researchers in this area. Despite 
the apparent lack of concentrated empirical 
efforts to understand the effects of structural 
variables (other than the variable of work- 
group size), there are, nevertheless, a number 
of studies in the psychological, sociological, 
and management-science literature that are 


w 
xy 
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pertinent to a consideration of structure. Al- 
though many of these studies were not con- 
ceived by their authors as investigations per- 
taining to structure as such, they do provide 
relevant and important evidence and thus will 
be included in this review. 

For the purposes of this Paper, structure 
has been defined to mean the Positions and 
parts of organizations and their systematic 
and relatively enduring relationships to each 
other. Thus, we are referring to the formal 
structure of organizations as might be indi- 
cated, in a superficial way at least, in the 
formal “organization chart.” Within the 
boundaries of this broad definition of struc- 
ture, we have identified the following seven 
structural properties or variables: 


Suborganization properties 

1. Organizational levels 

2. Line and staff hierarchies 

3. Span of control 

4. Size: subunits 
Total-organization Properties 

5. Size: total Organizations 

6. Shape: tall or flat 

7. Shape: centralized or decentralized 
The first four 

sidered as suborg; 


y crucial in the dis- 
‘size.” It is also im- 


: y exclusive. p 
‘span of control” 


ey are typically 
the research and 
ure on organizations, 


In essence, for this review these seven prop- | 
erties of organization structure will be re- 
garded as the “independent” variables. Sing 
almost none of the studies reviewed were es- 
perimental studies in the strict use of the term 
“experiment,” any relationships found be- 
tween structural variables and job attitudes 
or job behavior cannot be considered “cause 
effect” relationships. Thus, neither the sul 
nor our conclusions from them can prove tha 
the variations in structure cause the particular 
attitudes or behavior, However, the structura 
properties are referred to as the independen ‘ 
variables in the sense that if there are ni 
cause-effect relationships present they pro” 
ably are due either to the variation in struc 
ture causing the variation in attitudes or si 
havior, or else to some third variable de y 
mining both the structure and the attitu # 
and behavior. Although it is conceivable th 4 
the kinds of attitudes or behavior usually A 
vestigated in these studies could determin” 
the structure of organizations, this seems pe 
likely (at least in a direct fashion) in mO? 
cases, 3 

Turning now to our “dependent” variables, 
job attitudes and job behavior, we have use 
the term “attitude” in its broadest and loose® 
Sense of “opinion concerning some object, se 
this case jobs or aspects of jobs. Included ie 
der job-attitude studies would be the typi 
morale study, where the specific attitude ques 
tions are concerned with opinions about t? 
organization, the Supervisors, the working CO” 


ditions and other factors directly connect 
with the job situation j 


the heading of job he 
at were (with a few &* 
with the following tyP® 
ormance and output rate 
Fates, absenteeism rate?” 
employee-grievance rate 
the problem of the relationshiP 

des and these various I 
have not been dealt wit! 


ceptions 
ofi 
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these have merely been considered to be two 
categories of dependent variables that are of 
Interest to social scientists studying organiza- 
tions. 

Finally, the three major criteria for the in- 
clusion of studies reviewed can be listed: 

1. Studies of industrial and business or- 
ganizations. The review is purposely restricted 
to studies dealing with these types of organi- 
zations—rather than also including other kinds 
of organizations such as governmental agen- 
TA military units, schools, churches, and the 
ike—in order to keep the discussion reason- 
ably focused and delineated. 

2. Comparative studies. Only those studies 
mai have dealt with at least two (preferably, 
a e more) positions or degrees of struc- 
ia ong a given property or dimension are 
eat Thus, for example, a study of only 
the s ens attitudes would not be included in 
a oe on organizational levels since such 
wiih rd did not compare foremen’s attitudes 
eke hose of employees at some other organ- 
ieee level. Likewise, a study of only small 
tion = would not be included in the sec- 
Paton, studies of total size, since no com- 
wech a within that study are possible be- 
tra organizations and other-sized or- 
of SE ge Under this criterion, case studies 

Single organizations were also excluded. 
fon. Empirical studies. With very few excep- 
ite only those studies providing empirical 
mostly ¢ included. Articles based wholly or 
Fortea a ‘the opinions of the author unsup- 
Gil ston actual data have been excluded. In 
ete H articles based on casual observations 
could i seemed unlikely that the procedures 
iie e easily repeated by other researchers 

A been excluded. 
or Pi of the review to follow are ar- 
isted te same order as the previously- 
structure Pane properties of organization 
Permitting aus, we Dret take up properties 
rganization comparisons, fol- 


ow + 
ene by properties concerned with total-or- 
ization comparisons. 


ORGANIZATIONAL LEVELS 


Üi pes many years social scientists interested 
a noa organizations have tended either 
else to ¢ he effect of organizational levels or 

oncentrate on the simple dichotomy 


between managers and workers. In the 1940s 
most textbook writers in the area of human 
relations, following the lead of the Hawthorne 
investigations, spotlighted the plight of work- 
ers vis-a-vis managers in terms of the types 
of physical, social, and psychological condi- 
tions they had to endure. The emphasis was 
almost completely on the worker-management 
comparison. This tradition carried over into 
the 1950s when the “modern” management 
theorists, such as Argyris (1957), Haire 
(1956), Leavitt (1958), Likert (1961), and 
McGregor (1960), continued to focus, for the 
most part, on the relationship of managers 
to nonmanagement subordinates. 

This relative lack of attention to the effects 

of differentiation among levels within manage- 
ment is somewhat surprising when one con- 
siders the amount of interest that industrial 
psychologists and sociologists have shown in 
the effects of division of labor. Almost always, 
however, division of labor has been studied as 
it occurs along a horizontal dimension at the 
rank-and-file worker level. The fact that there 
is not only a division of labor horizontally 
but also vertically within industrial organi- 
zations has frequently been neglected. The im- 
portance of paying closer attention to this 
vertical dimension of structure has been well 
stated by Pfiffner and Sherwood (1960) when 
they point out that: 
[The] differentiation of task between echelons is of 
more significance to the selection and training of 
leaders at the several levels than may be indicated 
by the attention accorded it in the past. The psycho- 
logical adjustment necessary when one goes from one 
level to another is often difficult because of the 
tendency to continue former behavior patterns ©... 
At first glance this might appear to be a problem of 
human relations rather than of formal organization, 
but such a conclusion would be only partially true. 
Good job descriptions should reflect task differentia- 
tion at the various echelons. It is a matter of tasks 
combined with behavior . . . [p. 139]. 


In surveying the literature pertinent to or- 
ganizational levels and their impact on, or re- 
lationship to, job attitudes and behavior, we 
encounter several difficult boundary or defini- 
tional problems. The first such problem con- 
cerns the question of whether studies of occu- 
pational levels should be included in the re- 
view. For our purposes, since the focus is on 
organizational structural variables, such oc- 
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cupational level studies will be , generally 
omitted on the basis that they ordinarily in- 
vestigate levels in a societal or cultural set- 
ting rather than levels within an organiza- 
tional setting. A second problem in defining 
studies of organizational levels is posed by the 
investigation directed toward superior-subor- 
dinate relationships. Although in one sense 
these are studies that compare two different 
organizational levels, they in fact seldom pro- 
vide data that contribute to our knowledge 
of the effects of the vertical dimension of 
structure. This is because such studies are 
usually concerned with implications for supe- 
rior-subordinate authority relationships and 
not with implications for attitudes or be- 
havior that may be differentially required or 
exhibited at various job levels within organi- 
zations. Most such studies, therefore, will not 
be covered in this review. 

Within these broad limits, this section of 
the review will analyze the results of studies 
containing two or more organizational levels 
as the independent type of variable and one or 
more categories of directly job-related atti- 


tudes or behavior as the dependent type of 
variable, 


Attitudes 


Job Satisfaction and Morale. The first 
studies to be considered will be those that 
compare management with nonmanagement 
employees; after that, studies dealing with 


comparisons among levels within management 
will be reviewed. 


Herzberg, Mausner, Peterson, and Cap- 
well (1957) have summarized the literature 
through 1954 relevant to job satisfaction atti- 
tudes associated with organizational levels, 
They conclude that “one unequivocal fact 
emerges from the studies of job satisfaction; 
the higher the level of occupation, the higher 
the morale [p. 20].” It is clear from this 
statement that their review included studies of 
occupational level. The problem of drawing 
conclusions from these studies is that fre- 
quently the occupational level categories in- 
clude people who do not work in industrial or 
business organizations (e.g., professional peo- 
ple included in a category of “executive and 
professional”). Nevertheless, the Herzberg 
review does cite five studies “in which the 


morale of workers was compared to that of | 
their supervisors (Ash, 1954; Browne & Neit- | 
zel, 1952; Campbell, 1948; Hull & Kolstad, 
1942; Kolstad, 1944).” The study by Browne 
and Neitzel was actually based on different 
supervisory levels and will thus be reviewed 
below. The other four studies, however, did 
show greater job satisfaction among super- 
visory personnel when compared to workers. 

In a study not included in the Herzberg 
review, Morse (1953) reported on the satis- 
factions of 61 supervisory personnel compared 
with 600 workers in a single company. She 
found that: 


The supervisors are considerably more satisfied with 
their jobs and with the company as a place to work. 
They are somewhat less satisfied than the employees 
with their salaries and are about equal in satisfactio” 
with the employees regarding the advancement they 
have received in the company [p. 98]. 


Thus, Morse’s conclusions are in general 
agreement with the other studies cited above 
although, as she points out, first-level super- 
visors may not be as satisfied as the rank and 
file with the specific aspect of wages. 

Since the publication of the Herzberg 1e 

view, only one study (Handyside, 1961) has 
appeared that has focused on management 
versus nonmanagement differences in job satis: 
faction. This study involved 30 managers an 
Supervisory personnel and 467 production 
workers and found that “median satisfaction 
(toward jobs) was higher for managerial pe! 
sonnel, 
In recent years the content of job satisfac 
tion studies of different organizational levels 
Seems to have shifted from an exclusive focus 
on comparisons of Supervisory personnel vel’ 
sus rank-and-file workers to comparisons 
among different levels of management withi? 
organizations, 

A major point at 


state that: 
of management often have very 
(p. 23].” Their review refers to two articles 
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ree but neither article presents enough 
po pe the samples of respondents or 
hime ae to draw any meaningful conclu- 
basist us, there seems to be no substantial 
sis in the literature prior to 1954 to support 

this conclusion. 
Pee auies, plus one appearing prior to 
mous N erg review, seem to be nearly unani- 
totale i that job satisfaction or 
arkasina Te increase monotonically with in- 
lara s of management, and that there- 
heta e managers are more satisfied than 
satisfied es them in the organization but less 
study, b oi those above. The first such 
that bial rowne and Neitzel (1952), found 
vision B sg be for three levels of super- 
level of a positively related to the echelon 
articles e supervisors [p. 90].” In 1961 two 
doses Sone on different levels of man- 
196 1a) gi (Porter, 1961; Rosen, 
with et re s study, which was concerned 
sented = action with conditions of work, pre- 
within ndings for three levels of management 
manager single plant. He found that the top 
gers did S the plant and the middle mana- 
Other į not differ significantly from each 
hanai satisfaction, but both groups of 
an ted were significantly more satisfied 
however -level supervisors. Rosen also noted, 
of satis E that there was a similarity of profiles 
ieee oi among the three levels of man- 
Order co, such that there were high rank- 
etween relations (ranging from .67 to 93) 
egree ges pair of levels in terms of the 

n P satisfaction with the 24 items. 

amaS study (1961), two levels of 
i nent, first-line supervisors and lower- 
different, anagement, were surveyed in three 
naire de types of companies with a question- 
Des aoe to tap satisfaction of different 
archica] Ha relevant to a Maslow hier- 
Showeq eao of needs. The results 
Ment eg the higher of the two manage 
action ad generally reported greater satis- 
Yestigation as was the case in Rosen's 1m- 
ended to pe pattern of need satisfactions 
ubedi similar between the two levels. 
anq aoe to these 1961 studies by Porter 
investionth: Porter has completed a large-scale 
; Toss et of managerial need satisfactions 
evel ieee of management, from first- 
tvisors to company presidents. This 


ee Bap r Aer 
hous gers from a wide vari- 
ety of different types and sizes of companies 
located throughout the country. The results 
were in general agreement with those from 
the previous Porter and Rosen studies. Satis- 
faction increased with each higher level of 
management for three of five needs (esteem 
autonomy, and self-actualization needs) and 
profiles of satisfaction were generally similar 
across levels. (For security and social needs 
satisfaction was roughly equal across all levels 
of management.) It should be emphasized that 
satisfaction in this study was measured by the 
difference between obtained and expected ful- 
fillment. This is important because it means 
that for at least three types of needs lower- 
level managers were not just getting less 
fulfillment, as might be obvious, but there was 
a larger difference between their expectations 
and their fulfillment than was found at upper 
levels of management. 

Much of the same sort of results were re- 
ported in a study published by the Opinion 
Research Corporation (1962). Attitude data 
were collected from 1,200 managers showing 
that the percentage of unfulfilled needs de- 
creased from lower to middle to top manage- 
ment. For almost all of the specific questions 
asked of the respondents there were no re- 
yersals wherein a higher level of management 


expressed greater dissatisfaction than a lower 


level. 
Finally, in a study as yet unpublished in 


detail, Haire, Ghiselli, and Porter (1963) 
found that in a cross-cultural investigation of 
managerial attitudes in 14 countries, higher 
levels of management on the whole reported 
greater degrees of need satisfaction in their 
jobs than did lower-level managers. It thus 
appears that the phenomenon of increased 
job satisfaction with increasing level of man- 
confined to the situation exist- 
n companies but tends to be a 
of industrial organizations. 
priefly at this point, it can 
degree of assurance that 
the available literature on job satisfaction 
across different levels of organizations shows 
ob satisfaction at each higher 
true both for nonmanagement 
comparisons and also for 


agement is not 
ing in America 
worldwide fact 

To summarize 
be stated with some 


increasing J 
level. This 1S 
versus management 
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comparisons within the management parts of 
hierarchies. In addition, one other finding 
appears with some consistency: Patterns of 
satisfaction are roughly similar across dif- 
ferent organizational levels, at least within 
management. 

Other Attitudes. A number of studies have 
been concerned with the relationship of organi- 
zational levels to attitudes other than degree 
of satisfaction. However, such studies are 
somewhat difficult to classify and summarize 
since they cover a great diversity of types of 
attitudes. 

There are, nevertheless, several groupings 
that are reasonably discernible. The first of 
these comprises studies of the relative im- 
portance of different types of needs and 
motives at different organizational levels, Two 
studies (Fortune, 1947; Kahn, 1958) have 
made direct comparisons between nonmanage- 
ment and management levels with regard to 
the importance of different motivating factors. 
The two studies seem to present somewhat 
conflicting results for these comparisons. 
Kahn’s findings indicated that both workers 
and their supervisors attached high importance 
to security, but that supervisors attached less 
importance to high wages than did the work- 
ers. The Fortune study, on the other hand, 
indicated that when security was pitted 
against high wages in an “cither-or” choice 
situation, high-level executives chose the high 
wage situation. It is obviously difficult to com- 
pare the two sets of findings since both the 
samples and the procedures differed. What is 
clear is that we need considerably more evi- 
dence from a wide variety of types of organi- 
zations before we can make any broad general- 
izations concerning manager-worker differ- 
ences in the importance attached to different 
motivating factors, 

Several studies in the past 5 years have 
made comparisons of the importance attached 
to different needs and conditions of work 
among different levels within management. 
The first of these studies was by Rosen and 
Weaver (1960) who studied the degree of 
importance attached to 24 conditions of work 
by three levels of management within one 
plant. They found high rank-order correla- 
tions (ranging from .76 to .90) between each 
pair of levels with regard to the degree of 


EDWARD E. LAWLER, III 


importance attached to each of the 24 items. 
In terms of overall importance, the first-level 
supervisors attached generally more impor- 
tance to most of the items. Subsequent to the 
Rosen and Weaver study, Porter published 
two articles dealing with the importance at- 
tached to different psychological needs by 
different managerial levels (Porter, 1961; 
1963a). Both studies confirm one aspect of the 
findings of Rosen and Weaver; namely, dif- 
ferent managerial levels tend to be similar 
with regard to what they consider most im- 
portant. However, the later and more compre- 
hensive of the Porter studies differed in one 
respect from both the earlier Porter study and 
the Rosen and Weaver study. It showed that 
higher-level managers tended to attach some- 
what more rather than less importance to 
various needs, whereas the earlier studies in- 
dicated that higher managerial levels regarded 
most needs or conditions as less important 
compared to lower levels. The difference in 
these findings may lie in the fact that in the 
later Porter study the degree of formal educa- 
tion of the respondents was about equivalent 
across management levels, while in the earlier 
studies the degree of education was either not 
controlled or else the lower-level respondents 
had less education. It is conceivable that em- 
ployees with less education may be more prone 
to consider everything as important, whereas 
those with greater formal schooling may be 
somewhat more discriminating in attaching 
importance to different needs and conditions. 
Therefore, it may be necessary to control for 
differences in formal education when com- 
paring different management levels with re- 
gard to the importance attached to different 
psychological needs or conditions of work. 
Finally, the cross-cultural study by Haire, 
Ghiselli, and Porter ( 1963) showed that for 
almost all of the countries in their sample 
there was again a strong similarity between 
upper-level and lower-level managers in the 
relative importance attached to different needs- 
Turning to a different area of job attitudes, 
We find a small group of studies that can be 
put under the general heading of role percep- 
tion studies (Coates & Pellegrin, 1957; Por- 
ter & Henry, 1964a; Rosen, 1961b: Triandis, 
1959b). Although each of these studies was in 
some way concerned with role perceptions, it 
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cog be noted that they were addressed to 
er different aspects of role perceptions, 
ey utilized different types of samples, and, 
net they employed quite different 
Triandis to obtain their data. For instance, 
heave : (1959b) utilized the semantic dif- 
Béice A technique to study the “differential 
eer am of certain jobs and people” by 
pa ee managers, lower-level managers, 
He 2 and workers from one company. 
nae cl uded that “the most significant 
Gt ie, ich the perception of jobs is the level 
adii h [being perceived; p- 224].” In 
nee a a also found that managers made 
tons tet A eE among their job percep- 
gated ‘se id workers. Rosen (1961b) investi- 
a partic > ee levels of management in 
scriptions ar company ranked 16 “role pre- 
agersiat T His findings showed that man- 
at in h e two lower levels tended to be simi- 
pra “ed they evaluated their role as super- 
a con each level differed somewhat from 
entry est of the three levels. Porter and 
mana, Bite: studied the perceptions of 
evel i T in terms of the type of behavior each 
eir 6 ieved was most important for jobs at 
ig oe level, and found that at each 
Dhasis evel of management increasing em- 
Posed Pig put on “inner-directed” (as Op- 
Coates © “other-directed”) personality traits. 
diea a Pellegrin (1957), in a study 
ound th primarily toward self-perceptions, 
tended ra a supervisors and subordinates 
and sac A be aware of the rewards, penalties, 
ih Ra ices associated with high-level roles. 
achievern awareness differentially influences 
220)» ent desires on the two levels [p- 


on overall look at these diverse studies of 
evels re eption among different organizational 
Stems rao that again the variable of level 
Ypes of eve a strong relationship to these 
tions of Perceptions, just as it did to percep- 
wò een id job satisfaction. 

Ofial stru estigations have compared the fac- 
ersonne] Ictures of attitudes of lower-level 
1954) e those at higher levels. Baehr 
Sociates actoring the Science Research As- 
‘hi (SRA) Employee Inventory for a 

8hb-statu. 39 ploy: y . 
Collar s” group of managers and white 
of OP Sorinel and a “low-status” group 
ty workers and routine clerical per- 
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sonnel, found considerable, but not com- 
plete, similarity in the factors emerging from 
the two groups. What differences there were 
indicated that the factors of the high-status 
group tended to be more concerned with the 
overall functioning of the organization while 
those of the low-status group were more rele- 
vant to specific aspects of the immediate work 
situation. Triandis (1960) factored the se- 
mantic job structures of managers and super- 
visors in a manufacturing plant and compared 
these with the factored job structures of work- 
ers in the same plant. He found even greater 
factorial similarity between the two groups 
than did Baehr and concluded that: “Man- 
agement and workers, one might expect, will 
differ in their factorial structures when the 
domain of jobs is considered. The answer is 
that this is not true. The factor structures 
are surprisingly similar [p. 300].” It appears, 
therefore, from these two studies, that organi- 
zational level does not have a strong relation- 
ship to the structure of job attitudes. 

The salience of different features of jobs 
and jobholders appears to be related to the 
level at which a person is working in an 
organization. Two studies which demonstrate 
this are another article by Triandis and one 
by Pellegrin and Coates. Triandis (1959a) 
found that upper-level managers focused on 
such “expressive” categories of description as 
» and “polish,” while lower-level 


“graciousness 
managers and workers emphasized “instru- 
mental” categories such as “intelligence,” 


“skill,” and “dependability,” in describing 
different groups of employees. Likewise, he 
also found differences associated with the 
three levels of respondents when they were 
describing different jobs. The two managerial 
groups tended to use the criterion of the 
“nature of the work” to distinguish among 
different jobs, whereas the workers made their 
distinctions among jobs primarily on the basis 
of job requirements, particularly intelligence. 
Pellegrin and Coates (1957), in interviews 
witha sample 0 50 “top-level” executives and 
50 “first-level” supervisors, found that the 
ized the intrinsic 


former emphas!. features of 
their jobs whereas the latter viewed their jobs 


more as means to off-job rewards. This may 
indicate, in the terms of Herzberg, Mausner, 
and Synderman (1959), that managers in 
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TABLE 1 


STUDIES OF RELATIONSHIPS OF ORGANIZATIONAL LEVELS TO Jos Benavior 


N of N of Nof 
Investigators Type of behavior individuals levels companies 
Burns (1954) Interactions and activities 3 1 
Davis (1953) Communications (grapevine) 65 6 1 
Dubin & Spray (1964) Interactions and activities 8 2 3 
Martin (1959) Decision making ? 4 1 
Shartle (1956) Activities 14 2 y 


higher-level positions focus more on “moti- 
vating” aspects of their positions while super- 
visors in lower-level jobs are oriented more 
toward concern with the “hygienic” condi- 
tions of their positions. 

One other area of job attitudes in relation 
to organizational levels has received some at- 
tention, namely, types of leadership attitudes 
at different levels. The Ohio State studies 
(e.g., Fleishman, 1953; Fleishman, Harris, & 
Burtt, 1955) have demonstrated that attitudes 
about how supervisors should behave in rela- 
tion to their workers vary depending on the 
organizational level of the respondent. For 
example, Fleishman’s (1953) article showed 
that the higher the level of a manager, the less 
likely he was to believe that a supervisor 
should show “Consideration” attitudes toward 
his subordinates and the more likely he was 
to think a supervisor should put emphasis on 
“Initiating Structure” in dealing with subor- 
dinates. 


Behavior 


Studies of the relationships between or- 
ganizational levels and individuals’ behavior 
are relatively infrequent. The nature of five 
such studies that were located in the litera- 
ture, including the sizes of their samples, is 
indicated in Table 1. This table shows that 
the sampling, in terms of both individual sub- 
jects and organizations, is extremely limited. 
Any conclusions emerging from these studies, 
therefore, must be regarded with great cau- 
tion. One other point should be noted in Table 
1. These five studies dealing with behavior in 
relation to organizational level are not con- 
cerned with rates of production, absenteeism, 
safety, turnover, and the like. Instead, they 
concentrate on interpersonal interaction pat- 


terns, types of decisions, and types of func- 
tional activities performed by the individual 
in his job. The reason these studies do not 
focus on the types of behavior usually studied 
in the typical field investigation is that they 
are all concerned with differences in behavior 
between or among levels of management. 
Thus, output or production in the usual sense 
is not easily measured. Also, it is probably Bes 
sumed that turnover, absenteeism, and acci 
dent rates are so negligible at managerial 
levels that there would be little or no varia- 
tion in these indices from level to level. 

Keeping in mind the severely restricted 
nature of the samples in these five studies: 
there are several conclusions that seem to be 
supported by the data so far obtained: 

1. Informal Communication (Informa- 
tional) Patterns. Davis’ (1953) study of the 
grapevine operating in the management sec- 
tor of a particular company indicated that 
higher-level managers are much better in- 
formed on informal communication (as well 
as, presumably, on formal communication) 
items than lower-level managers and super- 
visors. Davis stressed the fact that, in this 
company at least, first-level supervisors were 
the most isolated from news traveling through 
the informal communication channels, 

2. Interpersonal Interaction Patterns. Three 
major types of interaction variables have bee? 
examined in several studies: (a) The ratio of 
upward (or superior) contacts to downward 
(or subordinate) contacts—the three studies 
(Burns, 1954; Dubin & Spray, 1964; Martin, 
1959) that deal with this variable present 
contradictory findings. Burns’ data (as pointe 
out by Dubin, 1962) show that the ratio 0 
superior to subordinate contacts decreases the 
higher the level of management; Dubin an! 
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anys data seem to indicate the opposite; 
Martin’s results show no changes in the ratio 
in relation to level. (b) Percentage of peer 
raters Burns’ and Martin’s findings 
oe that the relative number of contacts 
FETA DE decreases the higher the level of 
oe Dubin and Spray’s data are 1n- 
intacte « on this point. (c) Percentage of 
zational with individuals outside the organi- 
ba al unit (eg., factory, office, or com- 

Fd )—the data of both the Martin and the 
for = Spray studies show a definite trend 
the le neide contacts to increase the higher 
EHG of management, as one might expect. 
oes o; study indirectly supports this point, as 
the S Shartle (1956), who found that 
Sa S of seven organizations spent a 
ations proportion of their time on public re- 
k activities than did their immediate as- 


Sist; 
iate) a (d) Percentage of direct (face-to- 
of aa aeti Martin found the proportion 


tma aE contacts decreasing at higher 
Sement levels. 
only Types of Decisions. Martin’s study is the 
differen, of the five studies to focus On the 
i Sein fe in the types of decisions made at 
8ation evels of management. In his investi- 
own on four levels (from works manager 
h ht foreman) within a single plant, 
) that the higher the managerial level, 
(ig, © Steater the “duration” of the decision 
a he ee in time between the inception 
a tat process and the actual imple- 
Of its ¢ n of this decision and the verification 
Orrectness or incorrectness), and hence 
the ance the increase in the “riskiness” of 
decision ee and (b) the more abstract the 
ture» n Le., an increasing lack of &struc- 
: the data relevant to the decision). 
Ordi aa of Activities (e.g, planning, © 
ee Production, etc.). Data concerning 
o S in patterns of activities at different 
T i ations anans be are so meager that ger- 
> Sug rte cannot be made in this area, eve? 
t Spray ee studies (Burns, 1954; Dubin 
id Collect 64; Shartle, 1956) have attempted 
um Such information. 
mary 


Teyj 

s i 

trates 47 Of the relevant literature demon- 

at organizational levels are strongly 
oth attitudes and behavior. The 


evidence shows that perceived job and need 
satisfactions increase not only from rank-and- 
file positions to managerial positions, but also 
from lower management positions to middle- 
and upper-level positions. Thus, level seems 
clearly to be related to what individuals think 
they are receiving from their jobs. Level also 
seems to show a relationship to what employ- 
ees think they must put into their jobs, in 
terms of the behavior they perceive to be re- 
quired by their jobs. On the other hand, it 
should also be stressed that level does not 
necessarily affect all types of job attitudes. 
For example, it seemed to have only a slight 
relationship to the relative importance at- 
tached to different kinds of needs. 

In terms of behavior that is actually exhib- 
ited by employees, especially within mana- 
gerial ranks, level seems to affect the amount 
of information a person receives in his job, 
the types of interpersonal relationships he has 
on his job, and the types and nature of the 
decisions he must make in his position. How- 
ever, the evidence concerning these behavioral 
variables in relation to level is limited, and 
hence conclusions concerning them must be 
regarded as tentative at the present time. : 

The major research trend in this area in 
recent years, both in studies of attitudes and 
of behavior, seems to be a shift toward an 
increasing focus On the differentiation of 
organizational levels within menngemeni. S 
contrast with the former (almost comple e 
concentration on management versus nonman- 

ent differences. Eventually, it may be 
specks to describe organizational differences 
i major aspects of attitudes and aes ia 
at least four discrete points along the “4 

ical or level dimension of structure: He 

= ‘sons, first-line supervision, middle 
and-file postin Fe panagement So date; 
management, an ea ts of attitudinal 
however, We have no major sè 


or pehavioral data from a specific study that 
adequately S 


ample these four levels across 
different kinds of industrial or business or- 
i 
ganizations. 


Future re: 
need to focu 


search on organizational levels will 
s on the question of what it is 
t levels that seems to create dif- 
b behavior. In other 
el” is merely a con- 


“organizational lev 
? 
more fundamental, 


‘ont label for other, 
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variables that must be operating in the situ- 
ation. An understanding of the nature of these 
other variables and how they are distributed 
along the vertical dimension of structure is a 
research task for the future in this area. 
Future research will also need to focus on 
the necessity of disentangling the effects of 
individual-difference variables from the effects 
of organizational-level variables. Practically 
all of the studies so far carried out in this area 
have failed to consider the fact that individ- 
uals at different organizational levels may 
vary systematically in intelligence and/or 
personality traits. To what extent the obtained 
differences in attitudes and behavior are a 
function of these types of differences relative 
to differences due to job level is an open 
question. It is a question that will require 
sophisticated research for an answer. 


LINE AND STAFF HIERARCHIES 


The distinction between line and staff hier- 
archies is an old one in the study of formal 
organizations. Traditionally, positions con- 
cerned with the main operations of the organ- 
ization and within the direct chain of com- 
mand have been considered part of the line 
hierarchy, while positions concerned with 
auxiliary services and outside the direct chain 
of command have been considered part of the 
staff hierarchy. The staff’s function is to pro- 
vide specialized aid to the line, which is com- 
posed of “generalists.” Recently, this tradi- 
tional distinction has received attention from 
two sources. One group of writers (Brown, 
1954; Fisch, 1961; Leavitt & Whisler, 1958; 
McGregor, 1960) has contended that the 
distinction between these two parallel hier- 
archies is disappearing under the impact of 
modern technology. A second group of writers 
has dealt with the type of relationships that 
exist between line and staff managers. Both 
Dalton (1959) and Whyte (1961) have illus- 
trated the type of conflicts that can develop 
between managers in these two hierarchies ex- 
isting within organizations. 


Attitudes 


The evidence available concerning the 
amount of satisfaction provided by line and 
staff positions is consistent in showing line 
managers to be more satisfied than staff man- 


agers. Three studies by the Opinion Research 
Corporation (1959) reported that engineers 
and scientists in staff positions were less satis- 
fied with their jobs than engineers and scien- 
tists in line positions. In more recent studies 
using broader samples of managers, Rosen 
(1961a) and Porter (1963b) found that line 
managers reported a greater incidence of 
desirable conditions of work and perceived 
greater need satisfactions, especially in estee™ 
and self-actualization need areas. 

Three studies compared line and staff man- 
agers on the amount of importance they at 
tached to job-related factors and needs, and 
all three found essentially no differences be- 
tween the two groups of managers. Rosen an 
Weaver (1960) found that managers in line 
and staff positions did not differ substantially 
in their ratings of the importance of 24 desit 
able conditions of work, Porter (1963b) foun 
that there were no line-staff differences in thè 
importance attached to four of five nee 
areas; only on the need for autonomy did the 
two groups differ, with staff managers feeling 
this need was more important, Lawler 2? 
Porter (1963) found no differences betwee” 


the two types of managers in the importance 
attached to pay. 


are for success in their management positions: 
ff managers, sa 
and others might have pre 
dicted, felt they have to show more othe! 


directed behavior to succeed in their jobs tha? 
line managers, 


Behavior 


WA frequently cite, 
study by Dalton (1950) found that st@ 


managers in three plants had a turnover ratë 
between two and four times that of line ma™ 
agers. Davis (1953), ina study of one com 
pany, reported that staff managers were bette! 
informed than line managers. He attributé 

this superior knowledge on the part of st4 

managers to their greater mobility, 
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Staff executives in such areas as personnel and con- 
trol found that their duties both required and al- 
Toran them to get out of their offices, made it easy 
or them to walk through other departments without 
Someone wondering whether they were ‘not working,’ 
to get away for coffee, and so on—all of which 
meant they heard more news from the other exec- 
Utives they talked with [p. 47]. 


ed study (1954) also appears to support 
i notion of greater communication flexi- 
Hity for managers in staff positions. 


Summar y 


Ma the studies that have compared the 
Hija es and behavior of employees in line 
is elie Positions are all relatively recent, 1t 
Fiche to determine if the amount of 
ne eE between the two types of hierarchies 
Som fen decreasing over the long term, as 

. © Writers have suggested. The validity of 
lish ston probably will not be estab- 
7 until we have strictly comparative data 
iti differences from different time 
able S. In the meantime, however, the avail- 
conditi ne indicates that differences in 
are an ns between the two types of positions 
differen large enough to produce significant 
Staff nces in both attitudes and behavior. 
their jagers derive less satisfaction from 
irecteq S, feel they have to be more other- 
Avion, UNG exhibit different patterns of be- 
Consist he only area that failed to produce 
Ottance differences was the degree of im- 
tors ar attached to various job-related fac- 
tude w heeds. Apparently this type of atti- 
diti — little affected by organizational con- 

Surrounding line and staff jobs. 

differen tence on the attitude and behavior 
Positions > between managers in line and staff 
that „S Presents a picture of several factors 
Conflicts a responsible for line and staff 
Stance ` Lhe staff manager’s position, for in- 
about, Wires him to be knowledgeable 
time with specialty and spend considerable 
it Provig, a variety of types of executives; yet 
his n €s him with little opportunity to use 
Usup. Wedge for decision making unless he 


Manager ® authority of the line. The line 
him °°" S Position, on the other hand, places 
take „ > Situation in which he is expected to 


a . 
"esponsihine? on decisions for which he has 
Younger ility, from a man who frequently is 

and has been with the company a 


shorter period of time and who is not sup- 
posed to know the whole picture as he is. This 
is not a situation that lends itself to har- 
monious line-staff relations. The fact that 
managers in line and staff hierarchies see 
different traits being required for success and 
report different levels of satisfaction suggests 
that one way of easing some of the potential 
line-staff conflicts may be to take into account 
the psychological demands of the two different 
types of jobs in the selection and placement 
of managers. 


SPAN OF CONTROL 


Span of control is defined as the number of 
subordinates a manager is responsible for 
supervising. Since the early days of classical 
organization theory (Urwick, 1935), writers 
have prescribed what they felt to be an opti- 
mum span of control. In most instances, the 
classical theorists have assumed that a small 
span of control (few subordinates) is a “good 
thing” for any organization. Typical pre- 
scriptions have recommended spans of con- 
trol between three and six (Dale, 1952). 
However, classical organization theorists gen- 
erally have failed to consider what factors 
might call for a change in the size of the 
span of control. Recognizing this, Fisch 
(1963) has recently argued that such factors 
as level of management, company product 
diversification, and the personalities of the 
managers should be considered in determining 
the optimum span of control. i 

A vigorous dissent from the classical ap- 
proach to the span of control was made by 
Worthy (1950). His radical suggestion was 


that a large span of control is good since it 


provides better communication and greater 


ities for individual growth and initia- 
Opeth this statement by Worthy, the bat- 
tle line between the two approaches to organi- 
zation theory was clearly ee 
researchers have not been quick to oa n e 
fray, and as late as 1960 Pfiffner an e 
vond stated: “There has been relatively lit- 
tle, if any, empirical study of span of control 
[p. 156].” The majority of the research that 
has been done on the topic has been concerned 
with discovering variables that are related to 
the size of the span of control found in exist- 


ing organizations. Entwisle and Walton 
5 
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(1961), for example, found that the size of the 
organization and the function of the organiza- 
tion were both related to existing spans of con- 
trol. This type of research undoubtedly repre- 
sents a necessary first step in determining 
which conditions favor a large span of con- 
trol and which favor a small span of control, 
but it does not provide data on relationships 
to job attitudes or performance. In fact, only 
one such study was located in the literature. 


Attitudes 


No empirical studies were found that at- 
tempted to determine the relationship between 
span of control and employee job attitudes. 
Worthy (1950) did report that in one com- 
pany a large span of control had been found 
to be positively related to high morale; how- 
ever, he presented no published data to sup- 
port this point, and thus it is difficult to eval- 
uate his claims, 


Behavior 


A study by Woodward (1958) found a 
relationship between the size of the span of 
control at the first level of supervision and 
company performance when several English 
companies were viewed. The companies were 
divided into three groups based upon the 
method of production used—unit (e.g., pro- 
duction of unique units to customers’ orders), 
mass (e.g., production of large numbers of 
identical units by assembly-line techniques), 
and process (e.g., production of chemicals, etc., 
by continuous processing). For each of the 
three types of production, the most successful 
companies were those that were near the 
median span of control for that production 
type. These median spans of control (23 for 
mass, 49 for process, and 13 for unit) for the 
successful companies were substantially larger 
than those suggested as optimum by the class- 
ical theorists. Woodward’s study thus gives at 
least tentative support to Worthy’s point that 
a large span of control can produce high per- 
formance. However, it is important to re- 
member that Woodward’s study was con- 
cerned with the span of control of first-line 
supervisors, and although a relatively large 
span of control may have proved to be opti- 
mum at this level, there is no reason to believe 
the same necessarily will be true of the upper 
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levels of management. Perhaps the most sig: 
nificant point about the Woodward study § 
that the method of production was found to 
be an important variable in determining which 
span of control at the first level of super 
vision was optimum for producing high per 
formance. 


Summary 


The lack of research makes it impossible 
to state that any particular span of control 35 
best for producing high performance or pos“ 
tive job attitudes. What data there are SUS 
gest that it is not reasonable to expect any one 
span of control to be ideal for all organiz® | 
tional situations. Level of management an” | 
type of production appear to be two variables 
that influence the optimum span of contr 
for a given situation, and undoubtedly furth? 
research will uncover other variables that 4 
important. 


Size: SUBUNITS 


For the purposes of this review, any grou 
ing of the members of a business organizati? 
that systematically excludes part of the mer 
bership of that organization is considered * 
organization subunit. Primary work grouk 
departments, and factories (in multifacto 4 
companies) have been frequently stu! “él 
organization subunits. Previous summa ig 
of the research evidence on the relations, 
between organization subunit size an y 
attitudes and performance typically ba 
concluded that small organization subU $ 
are superior on all counts (e.g., Strauss ce 
Sayles, 1960; Viteles, 1953), Morale, abs°™,, 
rates, turnover rates, accident rates, and Pn 
ductivity are all considered to be bette! 
small than in large organization subunits: “y 
example of the broad statements supp "i 
small organization subunits is Viteles’ o 
clusion that “The size of the work go'y 
affects output and attitudes, which both te 
to be better in smaller sized groups [p. 14 fi 

Recently several articles (Herbst, 19 0” 
Indik & Seashore, 1961) have contained thts 
ries attempting to explain the beneficial effei 
of small organization subunits. Undoubt® y 
the appearance of these theories is due t° if 
aforementioned widespread acceptance 0 ww 
belief that the experimental evidence © 
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TABLE 2 


EEN ORGANIZATION SUBUNIT SIZE AND JOB ATTITUDES AND JOB BEHAVIOR 


W 
an 


Atti aat 4 
‘ S eee i Investigators Type of subunit considered e 
Job satisfaction Talacchi (1960) Factories Negative 

Kerr, Koppelmeier, & Departments Negative 
Sullivan (1951) . i 
Indik & Seashore (1961) Departments Negative 
Katzell, Barrett, & Parker Departments Negative 
(1961) — 
Campbell (1952) Work groups Negative 
Worthy (1950) ; , Negative 
Indik & Seashore (1961) Automobile dealerships Zero 
Absenteeism Revans (1958) Gas works Positive 
Revens (1958) Factories Positive 
Revans (1958) Factories Positive 
Acton Society Trust (1953) Factory , : Positive 
Baumgartel & Sobol (1959) “Plants” (airline locations) Positive 
Research Council for Economic Plants Positive (?) 
Security (Baumgartel & 
Sobol, 1959) E 
Hewitt & Parfitt (1953) Departments Paie 
Indik & Seahore (1961) Departments Posve 
Kerr, Koppelmeier, & Sullivan | Departments ositive 
5 aji: 
e & Mann (1953) Work groups (blue-collar) Positive 
Argyle, Gardner, & Cioffi Work groups Curvilinear 
Me & Mann (1953) Work groups (white-collar) Zero 
i ail] a - — 
Turnover i 61 Automobile dealerships Positive 
k & Seashore (1961) iti 
a Koppelmeier, & Sullivan | Departments Positive 
1951 ayy 
sanded (1956) Departments a 
Argyle, Gardner, & Cioffi Work groups 
a | (1958) <a 
Acci 5 Mines (Britain) ositive 
e ihe me Factories (Britain) Positive 
Revans (1958) Departments (Asia) Positive 
Reva (1988) Mines (U. S. A.) ae 
ae Department of Labor Factories (U. S. A.) Curvilinear 
“(Revans, 1958) r Negative 
Pte, Safety Council Factories (U. S. A.) g: 
58 - 
Lab A Casc arn Factories Positive 
abor disputes Cleland (1955) eines Positive 
j POE o 
>~ [Revans (1958) ae Sante 
P iyi ny divisions 
roductivity Katzell, Barrett, & Parker Company es 
(1961) ents egative 
Indik & Seashore (1961) es roe Negative 
Marriott (1949) bali Curvilinear 
Revans (1958) eG stort Curvilinear 
Revans (1958) esi poeta Curvilinear 
Pere carta (1961) Automobile dealerships a 
Argyle, Gardner, & Cioffi Work groups 
(1958) 
Telati A Positive y y ior to become more frequent as size increases, A negative 
i l ip indi attitude or behavior i S A curvilinear relationship indl- 
atesa D indicates eer indicates a trend ot ior ie neon less fredi nt as size Inet attitude or behavior. 


‘or the middle-sised subunit to exhibit the greatest or the 
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clusively supports the proposition that smaller 
organization subunits are always beneficial. 
Virtually ignored in the rush to accept this 
simple picture of the effects of subunit size 
and completely ignored in theoretical at- 
tempts to explain this picture has been the 
effect of two other variables: job level within 
the organization subunit, and type of organi- 
zation subunit (e.g., work group, department, 
factory). These variables may well function 
to limit the degree to which it is possible to 
generalize about the beneficial effects of small 
organization subunits. For example, increasing 
the size of a department may raise the absence 
rate among the department’s production work- 
ers, but it is doubtful that it will also increase 
the absence rate of the department’s manage- 
ment force. Likewise, increasing the size of 
a primary work group may cause lower morale 
in that work group, but increasing the size 
of a factory by adding more small work 
groups may very well not decrease the morale 
of the total plant. Thus, before we accept 
the conclusion that all types of large sub- 
units, regardless of job level being considered, 
are always inferior to small ones, it may be 
well to see if the evidence justifies such a 
conclusion. 

Since there are a relatively large number of 
studies that have investigated the relationship 
of subunit size to both attitudes and behavior, 
Table 2 was constructed to summarize most 
(but not all) of these studies. This table 
will serve as the basis for most of the 


subsequent discussion in this section on 
subunit size. 


Attitudes 


Job Satisfaction and Morale. As Table 2 
indicates, there is little doubt that subunit 
size is significantly related to differences in 
job attitudes. The evidence is strong that 
workers in small departments and work 
groups are better satisfied than workers in 
large departments or groups, as six of the 
seven studies reviewed show results in this 
direction. However, the evidence does not yet 
support the sweeping conclusion that all types 
of employees in smaller organization sub- 
units are more satisfied than those in larger 
subunits, since no studies were located, for 
example, that compared managerial job atti- 


tudes in different size organization subunits 
Thus, it is impossible to reach any conclusion ( 
about the relationship between subunit size 
and the job satisfaction of managers. 
Talacchi (1960) studied the level of job 
satisfaction among employees in 41 plants 
that were subunits of five larger organizations 
His results showed a strong trend (7’s ranging 
from —.42 to —.81) for job satisfaction 
decrease as plant size increased. Kerr, Kop 
pelmeier, and Sullivan (1951) found that for 
894 workers in 29 departments in two elec 
tronics plants job satisfaction was correlate 
—.46 with size of department. Indik 4" ) 
Seashore (1961) measured the amount © 
intrinsic job satisfaction expressed by worker 
in 32 package delivery departments, vary” 
in size from 15 to 61, and found that BF 
creased size led to lower satisfaction. L* 
wise, Katzell, Barrett, and Parker (196! 
found a significant trend for workers in Jarg 
warehouses to express lower job satisfactii 
than workers in small warehouses (st 
23 out of 23 correlations being in rs, 
direction). Campbell (1952), in study"? 
workers’ satisfaction with incentive if 
plans as a function of the size ( “a 
under 20 to over 100) of the work grouP i 
which they belonged, found that workers y} 
the smaller groups felt they had better kno r 
edge of how their pay plan operated and W by 
more satisfied with their pay. Finally, Wort 
(1950) reported: “Our researches dem 
Strate that mere size is unquestionably, re 
of the most important factors in determi g 
the quality of employee relationships: pi 
smaller the unit the higher the morale, yi 
vice versa [pp. 172-173].” Regrett@ te 
Worthy failed to document this strong Sa ú 
ment with any published data on the sie g 
the units studied, the type of employ 
studied, or the measure of morale used. gi 
- The only study that failed to show 4 we 
tive relationship between subunit size a” dy 
satisfaction was one reported in the previo g 
mentioned publication by Indik and Se Si 
(1961) which concerned employee job sa 
faction in automobile dealerships. Since Pal 
dealerships were relatively autonomous: pe 
perhaps not completely accurately cla y 
as subunits, the failure to find a strong ”~ 
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tive relationship in this particular instance 
is probably not too significant. 
_ Other Attitudes. Three studies (not shown 
in Table 2) investigated the relationship be- 
tween organization subunit size and job atti- 
ae other than those concerned with satis- 
action. Hemphill (1950) found that as group 
or increased, members expressed attitudes 
i. tolerance for leader-centered direc- 
(1956 group activities. In addition, Hemphill 
ch ) found that small groups were better 
haracterized by attitudes of high intimacy, 
pa hedonic tone, and low control than were 
wie groups. It should be noted that both 
Yet ye Hemphill employed subjects from 
indy i lverse types of groups (ranging from 
a work groups to - members of a 
thei y), and thus it is difficult to assess 
Ir relevance for industrial organizations. 
ne (1954) studied the relationship 
i group cohesiveness and size of work 
aish for more than 200 groups In a large 
Bener Hi organization. The results showed a 
deiren trend for feelings of cohesiveness to 
ever ase with increasing subunit size. How- 
an a was also a tendency for the 
“Nest size groups to fall at the two extremes 
erta Ssiveness; that is, they were either 
ness n high or extremely low in cohesive- 
ai evertheless, Seashore’s general con- 
related was that group size is negatively 
to group cohesiveness attitudes. 


Behavior 
ene tntecism. The greatest amount of evi- 
Organiz emonstrating a relationship between 
ears sa subunit size and job behavior ap- 
in Tabh the topic of absence rates. As showr 
foung s 2, 10 of the 12 studies reviewed 
Seng Positive linear relationships between 
Andin rates and the size of subunits. This 
8 appeared for factories, departments. 
fin primary work groups. Again, however, the 
rien is limited to absence rates among em- 
Stuq a the blue-collar level, since the only 
Workerg at separately looked at white-collar 
tween absences found no relationship 
aran kinif size and absence rates. 
"view of (1958) has provided an excellent 
SUbunj of much of the data on the topic of 
re ee size in relation to absenteeism. He 
ed that in five randomly chosen gas 


w 
s$ 


works,” ranging in size from 67 to 3,430 
employees, the duration of absence due to 
accidents and the log of the size of the works 
were correlated +.91. In addition, size of 
works was correlated +.62 with absence rates 
due to sickness and other factors. Revans also 
reported two further studies dealing with size 
of factory in multifactory organizations. Each 
study found significant positive correlations 
between log size of factory and absence rates. 
A study by the Acton Society Trust (1953) 
reported a correlation of +.44 between size 
of factory (ranging from under 100 employees 
to more than 1,000) and absence rates. 
Baumgartel and Sobol (1959) did a well- 
controlled study on the relationships between 
the size of work locations or “plants” of an 
airline and absenteeism and found a strong 
tendency for absenteeism to increase with size 
of the work location. These investigators also 
cited the report on Prolonged Illness Ab- 
senteeism rates. And Metzner and Mann 
for Economic Security, which showed that 
smaller-sized plants had lower prolonged ill- 
ness rates than medium-sized plants. How- 
ever, the same report indicated that there 
was not a consistent linear relationship be- 
tween plant size and illness. Several other 
studies, Hewitt and Parfitt (1953), Indik and 
Seashore (1961), and Kerr, Koppelmeier, and 
Sullivan (1951), have all found that larger 
departments of a company have higher ab- 
senteeism rates. And Metzner and Mann 
(1953) found that among blue-collar workers 
small work groups (seven and under) had 
fewer absences than did large work groups 
(over seven). 

Only 2 of the 12 studies dealing with sub- 
unit size and absenteeism failed to show posi- 
tive relationships between these two variables. 
Argyle, Gardner, and Cioffi (1958) reported 
that they found a curvilinear relationship 
between absence rates and work-group size 
(ranging from 1 to 20+), with the lowest 

2Much of the literature on the ee ies 


vior and size fails to i 
Lege page (eg. gas works, factories, Or 


s of larger organizations or are sepa- 


part: i F 
rate corporations. For the purposes of this review, 


+. was decided that unless the author specified he was 
z ken with separate companies or firms, all data on 
ene gas works, and mines would be dealt with 
a , 


as data on organization subunits. 


mines) are 
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absence rates occurring in the middle-sized 
groups. Metzner and Mann, in their above- 
cited study (1953), found an insignificant 
relationship, although in the expected direc- 
tion, between absence rates and work-group 
size for white collar workers. (It should also 
be noted that in the previously cited Baum- 
gartel and Sobol study of airline locations, 
the effects of location size were less for white- 
collar workers than for blue-collar workers.) 
Turnover. Table 2 shows that three of four 
studies found that turnover is greater in large 
units compared to small units. However, none 
of the studies compared the turnover rates 
of different-sized factories, and none of the 
studies considered turnover rates among 
managerial personnel. Indik and Seashore 
(1961) found that more employees were 
terminated in large than in small automobile 
dealerships. Kerr, Koppelmeier, and Sul- 
livan (1951) found a significant relation- 
ship between turnover and department size 
(r= +.49). Likewise, Mandell (1956), ina 
study of 320 companies, found that large of- 
fices tended to have higher turnover rates 
than did small offices. On the other hand, 
Argyle, Gardner, and Cioffi (1958) did not 
find any relationship between subunit size 
and turnover. 
Accident Rates. Revans (1958) has thor- 
oughly reviewed the literature on the rela- 
tionship between accident rates and organiza- 
tion subunit size. As the six sets of data 
covered in his review indicate, there is not 
a consistent pattern of relationship between 
these two variables. In fact, three different 
patterns were found: positive, curvilinear, and 
negative relationships. A review of the evi- 
dence cited by Revans leads to the conclusion 
that accident rates may be dependent upon 
the type of industry considered and upon the 
technological development of the country in 
which a study is carried out. At the present 
time it seems wise to speak only of the type 
of relationship that exists for a given industry 
in a given country, and not for industry in 
general. For example, as shown in Table 2, 
the evidence is fairly convincing that for 
United States industry large size is not asso- 
ciated with high accident rates, while for 
British industry large size does appear to be 
associated with increased accident rates. 
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Revans’ analysis of the accident rate for! 
British coal miners and quarriers in 1950 
showed a strong positive relationship between 
mine size and number of accidents. Lakes 
the Annual Report of the Chief Inspector H 
Factories for 1956 (cited by Revans) showé 
that the compensable accident rate rises stead 
ily with the average size of factory for Bat 
industrial firms. Also reported by Revans a 
a strong positive correlation (+.47) betwee! 
accident rates and department size for ste 
companies operating in Asia. | 

Turning to the data for American indust) \ 
Revans cited three studies that indicate 
either a curvilinear or negative relationship 
between subunit size and accident rates. ; i 
data for American mines, in contrast to eet 
lar data for mines in Britain, showed the ma 
est accident rates occurring in middle-s!” A 
mines employing 100 to 300 workers. e 
for United States factories were provided f 
two studies which showed somewhat aiies | 
results from each other. Bulletin 1164 of by | 
United States Department of Labor (cited a 
Revans) on “Work Injuries in the Uni $ | 
States during 1952” points to a curvilin® 
relationship in the following quotation: 


The larger establishments, which can afford train | 
safety engineers and which conduct intensive Saul | 
Programs, generally have the lowest rates. USU d 
the medium-size plants have the highest rates |. 
the smallest establishments show rates somewhat ag? 
low the medium-size plants but above the aveg) 
for the industry [quoted by Revans, 1958, P- A 
In a report from the National Safety Count | 
on 3,500 United States manufacturing pa . 
Revans cited data showing a strongly dec 
ing accident rate with increasing plant 5 v0 
Labor Disputes. Data contained in t 

sources (Cleland, 1955; Revans, 1958) Dae 
a positive relationship between subunit si 
and labor disputes. Cleland’s data dem? 
strated that as plant size increases, the p", 
ability of an industrial strike rises sig” K 
cantly. Similarly, Revans noted that Britt 
coal miners in larger mines were "i 
Sensitive to nonmonetary issues (although E 
to monetary issues) than were miners 0 
smaller locations. On the basis of these * y 
reports it is tempting to conclude that sms 
subunits are more likely to have fewer 14 e 
problems than larger subunits. However, 
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must be cautious in making this interpreta- 
tion, since it is not size of operation, as such, 
that affects the number of labor disputes, but 
rather the type of technology that is associ- 
ated with size (see Summary at the end of 
this section). y 
a ecient As Table 2 shows, the studies 
Ee relationship between performance and 
eaan subunit size do not present a 
does ut picture. Thus, the available evidence 
tion ree support the widely believed assump- 
ie that small subunits are superior to large 
S In terms of job performance. 
alle studies found better performance in 
exampl. subunits than in larger units. For 
Shad, a Katzell, Barrett, and Parker (1961) 
Ptofitabine product value, productivity, and 
ivisi lity were all lower in the large 
cus of the company they studied. Indik 
lation am (1961) found a negative corre- 
tions a etween the size of delivery organiza- 
Units $ rated overall performance of these 
study of arriott (1949), in a widely cited 
Size ang the relationship between work-group 
Ories Production in two automobile fac- 
but 2 voncinded from his data that: “Low, 
Which ce correlations were obtained 
etween emonstrate an inverse relationship 
Btoups Output and size, the smaller sized 
ach fa Showing consistently larger output 1m 
ctory [p. 56].” 
eo Contrast to the results from the above 
Other Studies, were the findings from five 
studies and reports; three of them 
i sia Vilinear relationships between sub- 
no ta production, one found essentially 
Ship ionship, and one a positive relation- 
evans (1958) reported that data 


Bathe t 
Britain by the National Coal Board in 


1509 t tained in middle-sized mines where 
Mosg) 2,000 men were employed. Revans 
n sale and Herbst (1957) both present data 
A ilins of retail sales units, and both report 
degize T relationships to size, with the mid- 
anq oo Units again performing best. Indik 
rDo a (1961), on the other hand, 
tween © a lack of significant relationship be- 
Perfon 2° Of automobile dealerships and job 

ance, and Argyle, Gardner, and Cioffi 


(1958) found a slight tendency for larger 
work groups to achieve higher productivity 
than smaller groups. 


Summary 


The literature on subunit size shows that 
when blue-collar workers are considered, small 
size subunits are characterized by higher job 
satisfaction, lower absence rates, lower turn- 
over rates, and fewer labor disputes. The 
evidence does not show, however, a consistent 
relationship for blue-collar workers between 
accident rates and subunit size, or between 
productivity and subunit size. Furthermore, 
none of the studies reviewed investigated the 
relationship between managers’ job attitudes 
and job behavior and organization subunit 
size. 

Future research might well consider the 
problem of assessing the relative impact on 
attitudes and behavior of the size of the 
different organization subunits of which an 
individual is a member. A typical worker is a 
member of at least three organization sub- 
units: a primary work group, a department, 
and a factory or office. Although the evidence 
indicates that large-sized subunits are associ- 
ated, for example, with high absenteeism, it 
is impossible to know on the basis of past 
studies if it is large size in the primary work 
group, department, or factory that is crucial 
in this relationship. Previous investigators 
have failed to control for variation in the size 
of the other types of subunits of which an 
individual is a member while they are study- 
ing the effects of size of one type of unit. 
For example, no study has compared ab- 
senteeism among work groups that are part 
of the same size departments and factories. 
Similarly, we do not know if factory size is 
related to absenteeism when all factories con- 
sidered have the same size departments and 
well be that absenteeism 


rk groups. It may a 
 aienaitd by work-group size but because 


o rk groups are found in large fac- 
oe mn departments a spurious a 
exists between absenteeism and ; oe $a : 
size and between eS aa ee oy ay 

the present time, f . 
Pe sea suggests that aS eD 
units are desirable for a number = r a 
Although small-sized units may not imp 
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Large | Low Cohesiveness; 


Subunit |——| High Task Specialization; }——> 


Poor Communication 
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High 
Job 
Dissatisfaction 


High Turnover; 
—>| High Absenteeism; 
Labor Strife 


Fic. 1. Hypothesized effects of subunit size. 


job performance directly, the fact that they 
are associated with low turnover and low ab- 
senteeism argues strongly for their value. The 
question then arises as to why small subunits 
seem to have certain advantages over large 
units in affecting attitudes and behavior. 
Figure 1 presents a diagram that offers a 
possible explanatory model for the effects of 
organization subunit size on satisfaction, 
turnover, absenteeism, and labor strife. As 
subunits increase in size, it is probably in- 
creasingly difficult to maintain high cohesive- 
ness and good communications. Further, task 
specialization may be more prevalent. The 
factors of low cohesiveness, high task speciali- 
zation, and increased communication difficul- 
ties might tend in turn to lead to high job 
dissatisfaction. As Brayfield and Crockett 
(1955), among others, have Pointed out, there 
is good reason to believe that dissatisfaction 
leads to higher turnover and absenteeism rates. 
Also, labor strife logically would seem to be 
affected by increases in job dissatisfaction. 

On the basis of the diagram presented in 
Figure 1, there is no necessary reason for 
large units to lead directly to decreases in 
productivity. Although the model does predict 
that large units will be characterized by, 
among other things, higher job dissatisfaction, 
reviews of the literature have not found that 
job dissatisfaction is associated with low pro- 
ductivity. Nor is there 


ivity, evidence that task 
specialization, low cohesiveness, and com- 


munications problems must necessarily lead 
to lower productivity. It is hot surprising, 
therefore, that the evidence on the relation- 


ship between productivity and subunit size is 
inconsistent. 


Size: TOTAL ORGANIZATIONS 


All of the studies discussed in the previous 
section were concerned with the relationships 
between size of some type of organizational 
subunit and attitude and behavioral indices. 
In the present section we will review only 


those studies that have dealt specifically vi 
the total organization as the unit of size. 
will become evident that the number of oa 
studies on total organization size is es 
lower than the number of studies on subuh! 
size? ae. 
The distinction between subunit size m 
total-organization size has seldom been m 
clear either in previous empirical studies ie 
ing with size or in various interpretatio 
of these studies presented in textbooks ae 
cerned with organizational behavior. It is a 
ceivable, for example, that although worki 
in a large subunit has disadvantages i 
demonstrated by the evidence in the precede 
section), working in a large total organizati 
might have advantages as long as the subur f 
within the organization are relatively SM he 
Therefore, it seems necessary to keep or 
subunit/total-organization distinction cle@ a 
in focus because the effects of one tyP? e 
size (e.g., total organizational size) maY 
confounded by the effects of the other al 


of size (ie. size of subunits within 
organizations). 


P: 

. . 5 
The absence, in most previous studies ad 
Size, of a distinction between subunits by 


total organizations has been compounde evtl 


the failure to take into account the job the 
of the employees 


i Dy 
) - As was pointed out ™ gy 
Previous section 


3 , organizational level 
Interact with size 


taining to lower 
up when appli 


y 
sang pe 
such that conclusion’ rol 
-level employees may mont 
ed to higher-level pers 

to 
* By the term “total Organization” we mea i H 
operating company headed by an executive e ii, 
title “President.” Tt is admittedly difficult at jo" 
to determine whether a “company,” in t epai 
sense of the word, should be considered 4 S 
total Organization, in our terms, or merely 4 ji if f 
of an even larger “corporation.” In genera py 
company has a chief executive with the title 0° on” 
ident and if that company can sell stock in i 
ently of other “companies” all under the same 


tate holding entity, we would consider it 
Organization. 


4 
conte 
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This possible interaction effect of level could 
presumably influence results both for studies 
of subunit size and for studies of total- 
organization size. With the foregoing con- 
Pp oe in mind, we now turn to a review 

| the few studies that have been concerned 
with total-organization size. 


Attitudes 


- E rank-and-file worker level, two 
seem ave appeared in the literature that 
of al be concerned with the relationship 
unit si -organization size (rather than sub- 
two a to job attitudes. The first of these 
sampli icles was by Benge (1944). Apparently 
es ok from a number of different compa- 
tien ee “composite attitude of 
as an yees in each company toward the boss” 
of roc of morale, he found that “morale 
abl Ployees of small companies is appreci- 
s better than that in larger companies 
ae ae These results are extremely dif- 
number. evaluate, however, since neither the 
er of or type of respondents nor the num- 
ased oro mpanies on which the results were 

hi was specified in the article. 
with Pane study that seems to be concerned 
o fifty relationship of total-organization size 
One p attitudes of rank-and-file workers Is 
is ee by Talacchi (1960). The study 
Sect as an investigation involving 93 
(Tal le Atots however, as one of the tables 
east rt in the article, p. 411) indicates, at 
“Dlant > of the 93 organizations are in reality 
copan; that comprise parts of only five 
Otted a Thus, most of the findings re- 
if one in this article are difficult to interpret 
Organiz sd interested in the effects of total- 
tis aa size, since the distinction between 
Comp] S and total organizations has been 
th rigs blurred. Nevertheless, it appears 
ating. article does present some evidence 
Organi onthe effects of size of the total 
e€ ke lon on employee job satisfaction. 
Was a Y result reported in Talacchi’s article 
tio oe of —.67 between organiza- 
Satisfacti and the overall index of employee 
Ployeg S as measured by the SRA Em- 
as; ine a iN: Employee dissatisfaction 
ganizat; ore, apparently greater within large 
Mum siz ions (which ranged up to a maxi- 
€ of 1,800 employees). Aside from the 


limitations in the study that have already 
been pointed out, this finding must be inter- 
preted cautiously because it is possible that 
the larger organizations contained larger sub- 
unit work groups. In other words, it is pos- 
sible that what seemed to be an effect of 
organization or, at least, plant size may have 
been due to work-group size. 

At the management level of organizations, 
several recent articles by Porter (1963c, 
1963d, 1964) examined the relationship be- 
tween three types of job attitudes and the 
size of company for which each manager 
worked.* In this questionnaire study of more 
than 1,500 managers, respondents were placed 
into one of three size-of-company categories: 
small (companies of under 500 employees), 
medium (500-4,900 employees), and large 
(more than 5,000 employees). One part of 
the data dealt with need satisfactions and 
indicated that the effects of size were modi- 
fied by the level of management being con- 
sidered. At the lower and lower-middle 
management levels, managers from smaller 
companies were more satisfied than those from 
larger companies. At the upper-middle and 
vice-president levels, the reverse was true, 
with the managers from the larger companies 
indicating greater satisfaction. With respect 
to a different area of perceptions, when re- 
spondents were asked to rank 10 personality 
traits in terms of their importance for job 
success, it was found that managers from 
larger companies placed more emphasis than 
did managers from smaller companies on 
innerdirected or non-organization-man traits 
compared to otherdirected or organization- 
man traits. When respondents (across the 
intermediate three levels of management) 
were asked to use semantic differential scales 


ed out that Porter also did not 
control for size of work groups or subunits in which 
an individual manager worked within his company. 
However, it is assumed that the concept of work 

—nu dividuals reporting to a single 
gront _—js a less meaningful and important var- 
supervisor because of the nature of 
it is probably not as 
n assessing the effects 
being considered are 
it is possible that this varia- 
ffect on Porter’s 


4 Tt should be point 


necessary 
of total size W: ‘hs 

Jevertheless, 
managers. Nev y 
ble did have some undetermined € 


findings. 
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to describe their jobs, managers from larger 
companies were slightly more likely to apply 
the terms “challenging,” “interesting,” and 
“competitive” to their jobs than were small 
company managers. (For both the personality 
trait and job-description results, management 
level did not seem to affect the findings. In 
other words, the effects of size were fairly 
consistent from one management level to the 
next.) These latter findings were interpreted 
as indicating that large companies place no 
more emphasis, perhaps even less, on con- 
forming, organization-man type of behavior 
than do smaller companies. 


Behavior 


With the exception of one quite limited 
area of investigation, there are almost no 
studies that specifically compare large total 
organizations or companies to small total 
organizations on behavioral measures other 
than attitude data.’ Undoubtedly the major 
reason for this lies in the difficulty of obtain- 
ing a sample of organizations (as compared 
to a sample of individuals or a sample of 
subunits within organizations) in sufficient 
quantity and with sufficient variation in 
total size, 

The one type of behavior that has been 
investigated in relation to total-organization 
size concerns the rate of managerial succession 
into top-level jobs. The first such study, by 
Grusky (1961), utilized Fortune’s annual list 
of the 500 largest United States companies to 
select two groups of organizations differing on 
total size. His results indicated that the fre- 
quency or rate of succession was positively re- 
lated to size such that there was more rapid 
turnover in the uppermost management posi- 


5The previously discussed article by Talacchi 
(1960) has sometimes been cited as an article that 
presents data on the relationship of “organization” 
size to job behavior, such as turnover rates and 
absenteeism rates, but no such data are given in the 
article. Instead, the findings are presented in terms of 
the relationship of size to satisfaction (as already 
noted above) and the relationship of satisfaction to 
turnover and absenteeism. Not only are there no 
data presented on the direct relationship of size to 
turnover and absenteeism, but also the data that are 
presented concerning these latter two variables in- 
volve intraorganization rather than interorganization 
comparisons. Thus, this article does not provide be- 
havioral data in relation to size of total organization. 
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tions within larger-sized companies. A study 
by Kriesberg (1962) seemed to confirm Gri 
sky’s findings. However, a later study by Gor 
don and Becker (1964), which involved a a 
analysis of the data used by Grusky, show 
very little relationship between size and i 
of managerial succession. Thus, the possible 
effects of size on this particular variable y 
organizational behavior remain obscure ? 
this point. 


Summary 


A ; ; that 
A review of relevant journals shows i A 
very few studies have been carried ou 


à H A ization 
the relationship of size of total organizat ‘|| 


to either job attitudes or job behavior. TH 
studies of worker attitudes that seem a 
in this category of investigations indica of 
that job satisfaction and morale were i 
in larger organizations. However, both stu ait 
were reported in such a way that it 15 K 
ficult to draw accurate or meaningful a 
clusions concerning the effects of organizat! © 
size. At the managerial level, Porter (19 fot 
1963d, 1964) found no overall advantage ; 
either large or small organizations in relat 
to need satisfactions; he did find, howe d 
that managerial jobs in larger companies W js 
seen as requiring a somewhat greater emph? 
on inner-directed behavior and as having i 
slightly greater amount of challenge and get 
est when compared to similar managerial J 
in smaller companies. jaa" 
Overall, the findings relating total-org4” as 
tion size to job attitudes do not presen al 
clear a picture as is the case for findings 4 Je 
ing with subunit size, It is entirely possi e 
that the negative effects of large subunit ê 9 
on job attitudes may not extend to a 
total-organization size. As has been sugges 


An increase in the total size of an organization" e 
the consequent technological advantages of anes 
scale operation—will not necessarily reduce the g5 
rale and job satisfaction of employees as 10 fieh 


intra-organization work units are kept small Po! 
1963d, p. 61]. 


This is a hypothesis that could be iit 
easily confirmed or disconfirmed by of 
studies. Further research on total-organi24 0% 
size should also take into account the 
sibility that size of the total units may 


nh 
one kind of effect on attitudes at the '@ 


— 
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and-file level, but a reversed effect at mana- 
serial levels, especially the higher levels. 
Again, this possibility would not be too 
difficult to test with additional research. 

In the area of behavior, there is almost a 
complete lack of information on the effects 
of total-organization size. It would seem that 
future studies on size could, among other 
a attempt to obtain measures of absen- 

sm, turnover, and similar rates in relation 

to total-unit size just as past studies have 
er this kind of data in relation to sub- 
size. At the moment, at least in the 
al see literature, we do not even know 
Oils E the United States Steels, the Standard 
PSR e General Motors, etc., have higher or 
Sat ee or absenteeism rates through- 
eir total organizations than do the local 

Pe or Ace manufacturing companies. 
be ge it must be emphasized that if such 
ior of total organizations are to be 
sizes igh, the effects of differences in the 
fine various subunits among the organiza- 
Civ be partialed out and job level 
will ty isa Perhaps total-organization size 
a a ia out to be an irrelevant variable in 
it wo such behavioral indices. At any rate, 
his ould seem worthwhile to know whether 

1S the case or not. 


SuHare; TALL OR FLAT 


e and flat organization structures are 
hase y distinguished on the basis of the 
to ¢ hy of levels in the organization relative 
Organi total size of the organization. A flat 
oe structure is one where there are 
organis, els relative to the total size of the 
is on ation and a tall organization structure 
© where there are many levels relative 

y Aa sie of the organization. Another 
ow Pie | this is to say that the degree 
Mined p, Structure is tall or flat is deter- 
the org y the average span of control within 

tt ganization. 

merits ntion has been focused on the relative 
l of these two types of structures since 
ed when Worthy published his widely 
E: p article on “Organizational Structure and 
po ae Morale” (see Gardner & Moore, 
(1o50 iteles, 1953; Whyte, 1961). Worthy’s 
less com asic Conclusion was that: “Flatter, 
mplex structures, with a maximum of 


administrative decentralization, tend to create 
a potential for improved attitudes, more ef- 
fective supervision, and greater individual re- 
sponsibility and initiative among employes 
[p. 179].” Although Worthy published no 
empirical evidence to support his statements, 
and although his observations were based 
upon his experiences in a single company, 
his opinions are frequently cited by other 
authors to support their contention that flat 
organization structures produce better per- 
formance and better job attitudes than do tall 
ones. Worthy’s opinions represent direct con- 
tradiction to those put forth by the classical 
organization theorists (e.g., Graicunas, 1937) 
who contend that a tall organization struc- 
ture improves performance by allowing for 
close supervision and therefore complete 
understanding by the supervisor of the 
subordinate’s activities. 

Despite the fact that this significant dis- 
agreement about the relative advantages of 
tall and flat organization structures has pro- 
yoked considerable comment in the literature, 
the topic did not generate any empirical 
research until 1962. 


Attitudes 
Three studies have compared the job satis- 
factions of employees in tall organizations 
with those in flat organizations. Meltzer and 
Salter (1962) published a study that reported 
on the job satisfactions of 704 physiologists 
(in nonuniversity organizations). They clas- 
sified their questionnaire respondents by size 
of company (fewer than 20 professional em- 
ployees, 21-50, and 51 or more), and by 
number of levels of administration within the 
organization (1-3 levels, 4-5, and 6 or more), 
and found generally insignificant relationships 
between tallness or flatness and job satisfac- 
tion. Their results, therefore, did not confirm 
about flat structure pro- 


Vorthy’s theory ) 
pared petter job attitudes. However, it 
should be noted that Meltzer and Salter 


tudied research organizations of extremely 
cial size, and hence their results may have 
s A ; 


imi lity. 
limited gorey Porter and Lawler (1964) 


i i ip between the need 
i igated the relationship 

i of over 1,500 managers and the 
a of organization structure in which they 
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worked. As in the Meltzer and Salter study, 
managers were classified as working in either 
a tall or flat organization structure based 
upon the number of levels in the organization 
relative to its total size. The results showed 
no clear overall superiority of flat over tall 
organizations in producing greater need satis- 
faction among managers. However, two quali- 
fications to this general finding were noted: 
First, organization size seemed to have some 
effect on the relationship between type of 
structure and the degree of need satisfaction. 
In companies employing fewer than 5,000 
people, managerial satisfaction was greater in 
flat rather than in tall organizations. For 
companies with 5,000 employees and over, the 
picture was reversed with a tall type of struc- 
ture producing perceptions of greater need 
satisfaction. The second qualification was that 
the effects of organization structure on need 
satisfaction appeared to vary with the type 
of psychological need being considered. A tall 
type of structure was associated with greater 
satisfactions in the security and social need 
areas, whereas a flat structure was asso- 
ciated with greater satisfaction in the self- 
actualization need area. 

Porter and Siegel (1964) have replicated 
the Porter and Lawler study with one es- 
sential difference—the sample. The Porter and 


Siegel sample of close to 3,000 managers was ` 


an international sample representing middle 
and upper levels of Management in a wide 
variety of sizes and types of companies in 13 
countries. The results of the newer study gen- 
erally agreed with those found by Porter ‘and 
Lawler. Porter and Siegel found that for 
companies of less than 5,000 employees, flat 
structures produced greater managerial ‘nerd 
satisfaction, whereas for companies of 5,000 
and over, there was no difference between tall 


and flat structures in producing managerial 
need satisfactions. 


Behavior 


Meltzer and Salter’s (1962) study was the 
only empirical investigation that compared 
the job performances of individuals working 
in organizations with tall and flat structures. 
They found generally insignificant relation- 
ships between tallness and flatness and job 
performance (as measured by the publication 
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of research articles). Only in large-size Or- 
ganizations did they find a significant trend 
and there it was in the direction of greater 
productivity in tall than in flat organizations. 


Summary 


The evidence does not support Worthy’s 
(1950) sweeping generalization that a flat 
organization structure produces greater job 
satisfaction and improved job performance 
The evidence points to organization size 35 
one of the factors affecting the relative a 
vantages of tall and flat organization strut 
tures. Two of the studies reviewed found that 
in relatively small organizations a flat 0% 
ganization did appear to be advantageous m 
terms of producing managerial job satisfac 
tions. However, for relatively large organizē 
tions one study found that tall organizati” 
structures produced greater job satisfactio™ 
and one study found that tall organizatio” 
structures fostered greater productivity- THis 
it appears that the advantages of a flat stru“ 
ture not only decrease with increasing organ” 
zation size, but that in relatively large % 
ganizations a flat structure may sometim 
even be a liability, e 

One explanation for the tendency for n 
advantages of a flat organization struct? 
to decrease with increasing organization Siy 
can be found in the characteristics gene™ < 
attributed to tall and flat organization st 
tures. Advocates of flat organization St" r 
tures claim that because of the large a 
age span of control subordinates will P% 
Sreater freedom and autonomy to make © 
cisions. As a result of this relatively 8° |. 


u 
freedom and autonomy, individuals aré a 
posed to contribute more to the organiza sir 
and receive a 


Ove greater satisfaction from My 
S Un the other hand, a tall structute g 


è 1 
creases supervisory controls and allows 5" 
riors to better 
sub 


lem 


job 
eit 
. Coordinate the activities O° © a 
ordinates, In a small organization P do 
S of coordination and communicatio” be 
not tend to be severe, simply becaus® gr 
organization is small. Thus, in a smā ip 
Sanization there would be little advantae® d 
a tall structure and, in fact, since it A 
to amplify the disadvantages associate ot 
tight managerial control, a tall structure Pir 
ably is a liability in a typical small orga” 
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tion. In large organizations, on the other hand, 
problems of coordination and communication 
are complex. Thus, for large organizations a 
taller type of structure may be needed to 
overcome these problems and allow man- 
agers to supervise their subordinates more 
effectively. 

Future research will undoubtedly show that 
other variables such as type of company—for 
example, retail trade versus manufacturing 
firms—will have an important bearing on 
Which type of structure is more advantageous 
a terms of job attitudes and job performance. 
W orthy (1959) himself has pointed out that 
different degrees of flatness may be desirable 
depending upon the type of organization 
considered. 


SHAPE: CENTRALIZED OR DECENTRALIZED 


is be contemporary trend in large-scale organization 
a ward decentralization, but we must realize that 
centralization is several things to different people. 
ts are those who view it entirely in terms 0 
a ion-making; others see it from the standpoint 
and oSrâphical dispersion of plants and installations; 
Poio others approach it as a philosophy of cor- 
batea life, a set of organization values with socio- 
ESR Psychological, and spiritual facets (Pfiffner & 
Twood, 1960, p. 190]. 


ia is no reason to believe that these dif- 
a ent types of decentralization produce simi- 
t effects on job attitudes and performance. 
ab Us, the first point that should be considered 
out any study concerned with the effects 
tain a tralization is the measure of decen- 
zation that is used. 
s ating to Chandler (1956), the move- 
20 to decentralization can be dated to the 
S and the management policies of Alfred 
ieee Jr., at General Motors. That the 
air rh decentralization is currently mm the 
numb clearly demonstrated by the great 
Cally er of recent articles that have dogmati- 
ate Ga forth plans for a decentralized corpo- 
Run. of life (see Cordiner, 1956; Ginzberg 
19 ee 1957; Lawrence, 1958; Smith, 
Offerin, Most writers have claimed that by 
regardi increased autonomy for the individual 
ecent: ess of his level within the company, 
ralization improves both job attitudes 
ralia ormance. Since the topic of decen- 
ion has received considerable attention 


for the last forty years, it is not surprising 
that there has been a certain amount of 
research on the topic. However, most of the 
research on the effects of decentralization has 
been of the case study variety (e.g., Drucker, 
1946; Given, 1949; Selznick, 1949), and 
therefore not within the scope of this review. 
In fact, only four studies were found that 
compared the job attitudes and behavior of 
employees in centralized and decentralized 
companies. 


Attitudes 


A study by Baker and France (1954) 
compared the attitudes of managers in cen- 
tralized and decentralized industrial relations 
departments. Their classification of centralized 
or decentralized departments was based upon 
the level at which decisions relative to indus- 
trial relations were made. When they asked 
managers which type of structure produced 
the best intramanagement relations, managers 
who worked in companies with centralization 
favored centralization, and managers who 
worked in companies with decentralization 
favored decentralization. Baker and France 
further found that companies with decentral- 
ized industrial relations departments revealed 
no greater satisfaction among their plant man- 
agers than did those companies with central- 
ized industrial relations departments. Although 
this latter finding does not support the claims 
of those who favor decentralization, it is im- 
portant to remember that the study was 
concerned with decentralization in only one 
type of department, and the attitudes of only 
one level of management. Thus, it is impos- 
sible to know if the findings of Baker and 


France are applicable to the effects of 
ide basis. 


decentralization on & company-Wwi 

Litzinger (1963) has compared the atti- 
tudes of bank managers, who were under 
centralized management with those who be 
under decentralized management. His results 
indicated no clear attitude differences between 
the centralized and the decentralized groups 
of managers. However, the results are difficult 
to interpret because it is not clear from the 
article what measure of decentralization was 


used. 
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Behavior 


Two studies were found that compared the 
behavior of employees in centralized compa- 
nies with the behavior of employees in de- 
centralized companies. Carlson (1951) found 
that executives in decentralized companies 
spent only 6.3% of their time “taking de- 
cisions,” while executives in centralized com- 
panies spent 14.6% of their time taking 
decisions.” He further found that executives 
in decentralized companies spent less time 
giving orders (6.8% versus 13.87%) than did 
executives in centralized companies. Carlson’s 
finding that executives in decentralized com- 
panies gave fewer orders and made fewer 
decisions supports the claim that decentrali- 
zation can lead to greater autonomy at the 
lower levels of the organization. However, 
there is a problem with this study that makes 
the results difficult to interpret. It is not 
clear what Carlson used as his criteria for 
classifying organizations as centralized or de- 
centralized. Thus, it is impossible to know 
which particular type of decentralization 
Carlson’s findings support. His findings are 

further limited by the fact that he had only 
nine managers as subjects in the study. 

Weiss (1957) studied the relationship be- 
tween centralized and decentralized organi- 
zation structures and several measures of be- 
havior. He classified 34 companies as either 
centralized or decentralized based upon their 
answers to a 22-item questionnaire about the 
level at which decisions were made in the 
company. Weiss found no significant differ- 
ences between centralized and decentralized 
companies on any of the following variables: 
turnover rate, number of grievances, number 
of white collar workers, absenteeism, accident 
frequency, accident severity, and age of man- 
agers. It should be pointed out that although 
Weiss found no statistically significant dif- 
ferences, the trend on each of the factors 
considered was favorable to decentralized 
organizations. 


Summary 


The studies reviewed offer no clear support 
for the proposition that decentralization can 
produce either improved job attitudes or per- 
formance. Thus, Pfiffner and Sherwood’s 
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(1960) conclusion that decentralization must 
be accepted partially on faith seems com- 
pletely justified. The fact that two of the 
studies reviewed obtained a few significant 
results and that another showed interesting 
trends suggests that the topic is a meaningful 
one for research. Perhaps the chief obstacle 
to research in this area, though, is the lack 
of an adequate method for measuring the 
degree of decentralization that exists in a? 
organization. Until such a measure is de 
veloped the research evidence gathered will 
undoubtedly remain difficult to interpret. 

It is doubtful that future research will eve" 
prove that any of the three types of decen- 
tralization mentioned earlier are superior t 
centralization in all situations. Marscha’ 
(1959) has presented a model graphically 
illustrating that the efficiency of decentrali- 
zation of decision making varies greatly 
depending upon the type of business m 
which the company is engaged. Sloan (1963) 
has made a similar point by stating tha 
different degrees of decentralization appea!® 
to be optimum for DuPont and Genera 
Motors because of their different product 
lines. Thus, for companies in a business wher? 
there is a high degree of complementarity 
among the members’ actions (e.g, airlines ° 
railroads), such that one member’s actio 
depends directly upon what his fellow en 
ploy ees are doing, a high degree of centrali2# 
tion is crucial. However, for companies A 
business where there is a great deal of iP e 
pendence among the action of the membe" 
(e.g., chains of retail stores), such that 0%, 
member’s action does not depend on what °° 
fellow employees are doing, increased dece” 
tralization may be desirable. 


CONCLUSIONS 


In this final section, two broad questio” 
will be considered: First, what is the statt” 
of our current knowledge concerning the poe 
sible effects of structure? Second, whe 
should future research efforts in this area P 
directed? 

With respect to the first question, a review 
of the literature demonstrates the followin’ 


. iga 
_ 1- Five of the seven properties of organi, 
tion structure (span of control and centr? 
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zation/decentralization being the two possible 
= oes have been shown to have some 
ind of significant relationship to either job 
ee or job behavior, or to both of these 
e es of variables. However, as we stressed in 
os introduction, experimental “proof” of 
A e-effect relationships between structure 
and employee attitudes and behavior is elusive 
almost nonexistent. 
a Certain structural variables seem to have 
avi ger relationships to attitudes and be- 
ior than other structural variables. On the 
it f the evidence to date, the two proper- 
tionshi structure that have the strongest rela- 
“de a with, or effects on, the two types of 
an ndent” variables are two suborganiza- 
unit a operties: organizational levels and sub- 
connectig Level was shown to have a definite 
sul ade a with several types of attitudes, and 
Strong] size was found to be associated 
and oY with both job satisfaction attitudes 
Prete types of behavior. Three other 
Organi les—line/ staff type of position, total- 
entea a size, and tall/flat shape—ac- 
Severa] for some significant relationships to 
Stren of the dependent variables, but the 
8th and clarity of their relationships were 
tional 1o as was the case with organiza- 
three f evels and subunit size. Each of these 
aS ext, ormer variables has not been researched 
ang , Sively as have the latter two variables, 
will i it is possible that future results 
relatio Ow them to have somewhat stronger 
the oe than appears to be the case at 
9 anna E The other two variables, span 
have os and centralization/decentralization, 
related rm r a been found to be sient 
o er, tative, Wa or behavior. Again, 20 A 
ut y little research has been carrie 
these two structural variables and they 
infiuence be found to have more important 
at tha CS On employees than seems apparent 
moment, 
ertain dependent variables are more 
related to properties of structure than 
amn „kinds of dependent variables. For 
be pole, indices of need satisfaction seem to 
ria Pro; oe strongly correlated with struc- 
Portance oni than are indices of need ii 
ologi hig makes good sense logically an 
hoa T , in that satisfactions presum- 
be determined more by environ- 


high 


l 
are y 


Syc 
ably 


mental factors (e.g., the degree of job re- 
sponsibilties, supervisory behavior toward 
subordinates, size of work group, etc.) than 
by personal factors (e.g., degree of extro- 
version), whereas employee wants and desires 
are likely to come about more from personal 
factors than from aspects of the work environ- 
ment. On the behavioral side, absenteeism and 
turnover seem somewhat more clearly related 
to structural factors than does employee out- 
put per se. The reasons why this should be so 
probably revolve around the fact that output 
or performance rates seem to be much more 
complesly determined phenomena than are 
absenteeism and turnover rates. That is, the 
latter types of behavior are within the em- 
ployee’s repertory of potential actions; how- 
ever, all levels of output are not within his 
repertory of actions both because of personal 
limitations and because of possible environ- 
mental factors such as work-group pressure 
and equipment limitations. Therefore, if 
structure affects various aspects of behavior 
largely through its influence on employees’ 
motivations, absenteeism and turnover rates 
will be more directly affected than will output 


levels. 


In general, the impact of structural vari- 


ables appears to be clearer on attitudes than 
on behavioral variables. However, this con- 
clusion should be interpreted with caution 
because 2 search of the literature reveals that 
for most of the structural properties some- 
what more research has been carried out on 
their relationships to attitudes than to types 


of behavior. 


4, The direction of relationships of certain 


structural variables to certain dependent vari- 
ables seems clear. The following tert 
relationships seem to be well supported by 


the research evidence to date: (a) organiza- 
ional Jevels—positive relationship between 
he level and degree of job and need 


height of iti ionship between 
i i ositive relationship bí 

ee ial and perceived necessity for 
hog b behavior; (b) line/ 


i irected type of job r; (b) 
mner erarchies—positive relationship be- 
s 


a ition and degree of 

n line type of position ar g 
TA satisfaction; (c) subunit size—negative 
T een size and degree of job 


elationship betwe 3 ; \ 
Satisfaction, positive relationship between size 
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and absenteeism rates, positive relationship 
between size and turnover rates. 


For certain other structural variables, the 
direction of their impact on certain dependent 
variables is not clear, even though particular 
studies found statistically significant relation- 
ships. Thus, it is not at all certain whether a 
flatter type of organization structure is al- 
ways associated with greater employee satis- 
faction compared to a taller structure, or that 
a smaller total organization results in greater 
employee satisfaction than does a larger total 
organization. As will be pointed out below, 
the direction of relationship in each case 
seems to be affected by the interaction of 
some second structural variable. Also, the 
evidence on the relationship between subunit 
size and output shows a wide scattering of 
types of relationships, going all the way from 
negative through curvilinear to positive rela- 
tionships. Thus, it may be that subunit size is 
an important determinant of employee pro- 
ductive behavior, but that the direction of 
its effects depends on relatively local and 
particularized conditions. 

We now turn to our second question: 
Where should research efforts be directed in 
the future? 

First, we would suggest that future research 
investigations in this area must be addressed 
to more complex questions. It seems evident 
that a great deal more attention has to be 
given to the possible interrelationships be- 
tween and among different organization struc- 
tural variables than has been the case so far. 
There is already some indication in the lit- 
erature that such interactions among two or 
more properties or dimensions of structure 
are likely. For example, the available evidence 
indicates that in order to determine the effects 
of either subunit size or total-organization 
size, one must specify or take into account 
the organizational level or levels being consid- 

ered. It seems probable that certain effects of 
size, either subunit or total, that are present 
when rank-and-file workers are the objects of 
investigation may be greatly reduced or pos- 
sibly even reversed at management levels, 
especially middle and upper executive levels. 
With respect to a different structural variable, 
tall/flat organization shape, the available data 
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indicate that its relationship to job satisfac- 
tion is modifed considerably by the total size 
of the organizations being studied. In other 
cases there are, as yet, few data to support 
(or deny) a notion of interaction effects, yet 
logic would lead one to predict the likelihood 
of this kind of situation. As just one example, 
it appears probable that success, or lack of 
it, in cases of deliberate organizational de- 
centralization may well depend upon which 
organizational levels are to be involved in the 
decentralization and upon such other intra 
organizational structural variables as the spa” 
of control and sizes of subunits created by the 
change. 

To sum up this point, there are already 
enough indications in the literature to support 
a greater research effort to investigate the 1” 
teractions among structural properties of 0"; 
ganizations in their relationship to employees 
job behavior and attitudes. Too much prev” 
ous theorizing in the area of organizations 
has neglected such interaction possibilities 
and hence there has been an unfortunate 
tendency to oversimplify vastly the effects Q 
particular variables, Organizations appear 0) 
be much too complex for a given variable t° 
have a consistent unidirectional effect acros 
a wide variety of types of conditions. d 

Second, there is an obvious need for vo 
experimental studies, as well as the mO 
typical field comparative studies. We need, ' 
other words, investigations where one or mor” 
structural variables are systematically manip 
ulated in field, not laboratory condition® 
There are admittedly great practical difficult a 
im carrying out such investigations, yet W% 
the study of organizations occupying suc 
portant place in the behavioral scienci 
today, it is time that researchers in this 4° 
faced up to the necessities for this kind 
undertaking. Perhaps this will require proje® 
of greater scope and support than in the pers 
and most certainly it is likely to require 5 


cross-discipline approach. But such proje 
are definitely not out of z 


k the realm of | he 
Possible. As examples, one can conceive W! 


out too much difficulty of the possibilities S 
manipulating such variables as span of c9 p 
trol or size of subunit. The results of a 
studies would provide firm support for 5° 


| 


PROPERTIES OF ORGANIZATION STRUCTURE 49 


aspects of organization theories, where such 
support is largely lacking at the present time. 
Third, longitudinal studies are probably 
necessary to draw valid conclusions concern- 
ing certain structural variables. Results from 
reo so far carried out on the relationships 
wae level or line/staff hier- 
lintati to job attitudes are subject to the 
iche ion that any obtained relationships 
types P part be due to differences among 
ls people rather than to differences 
5 types of positions within organizations. 
the proinne method of sorting out the rela- 
ae: ects of these two types of variables, 
ih a individual and organizational, would 
seins conduct longitudinal studies where the 
fee individuals are followed in their careers 
anther kind of organizational position to 
spicuow, Such longitudinal studies are con- 
Gia by their rarity in the literature on 
eas structure. 
tm we feel that the importance 
tire te to the study of organization struc- 
E fut not misplaced. Increased attention in 
Sf or ure to research on structural properties 
tg should improve our under- 
ng of the way people think and behave 


w d bi 
ga en they function in their jobs within or- 
nizations, 
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EXPERIMENTAL ATTEMPTS TO PRODUCE 


PHENYLKETONURIA IN ANIMALS: 
A CRITICAL REVIEW? 


RATHE KARRER anp GLENN CAHILLY ? 


Research Department, Training School at Vineland, New Jersey 


Studies attempting to produce PKU in animals are extensively reviewed. While 
some studies claim to have produced and consequently even prevented PKU in 
animals, such claims were found to be unjustified at the present stage of research. 
Behavioral and biochemical criteria for assessing PKU are discussed along with 


certain methodological problems inherent i 
duction of PKU, on the other hand, offer. 
of the relations between development, intel 


Phenylketonuria (PKU) is considered an 
“inborn error of metabolism” produced by an 
autosomal recessive gene. The primary defect 
is known to be a deficiency of the liver en- 
zyme phenylalanine hydroxylase which results 
in reduced conversion of phenylalanine to 
tyrosine with a consequent buildup of phenyl- 
alanine and its degradation products. The de- 
fect does not become apparent until shortly 
after birth when production of the enzyme 
normally begins. At this time the brain is 
undergoing its most rapid rate of biochemi- 
cal differentiation. This biochemical syndrome 
usually results in severe mental deficiency by 
as yet unknown processes. There is evidence 
that the defect can be controlled and the re- 
sultant mental deficiency averted if the pa- 
tient is placed on a diet low in phenylalanine 
within the first months of life. As Knox 
(1963) points out, the situation is one of ab- 
normal growth and development in an abnor- 
mal chemical milieu rather than one of toxic 
effects of abnormal metabolites, Thus, the 
question concerns the nature of the alterations 
induced by the abnormal chemical milieu in 
the developing brain rather than the effect of 
abnormal metabolites on the normal, devel- 
oped brain. 


1 The authors would like to thank George Rendina 
for initiating the interest in this area and for many 
informative discussions of the problems. Thanks are 
also expressed to Johs. Clausen for his cooperation 
and pertinent comments in the organization of the 
problem, and to Ethel Weiss for comments on the 
manuscript. 

2Now at Worthington Biochemical Corporation, 
Freehold, New Jersey. 
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n such work. The experimental pro- 
s an important model for the study 
lectual functioning, and biochemistry. 


Although PKU occurs relatively rarely ie 
proximately 1:10-25,000) and accounts 
less than 1% of mental retardates, it has a 
tracted much attention and research of nye 
Its importance is primarily as a model for 4 : 
study of the relations between genetics, few 
chemistry, and behavior, Within the past 
years there have been a number of attemp i 
to experimentally produce PKU in anima, 
(cf. Waisman, 1963) and thus bring these ipe 
lations under experimental manipulation. ~ g 
realization of this goal is of some importa?” 
not only to those concerned with menta the 
tardation but also to those concerned with ns) 
interrelations of normal cognitive funct is 
biochemistry, and development. Since : 
Work is perhaps less known outside of me? il 
retardation research, the present pape! ‘ 
review in some detail and try to evaluate C" ly 
cally these experimental studies. According, 
with few exceptions, only studies whose oat 
Purpose was to attempt to produce A 
judged by biochemical iea A or peha“ 
ioral criteria will be included. ye 

Attempts to produce PKU in animals a 
utilized three methods: (a) the admin's” o 
tion of large doses of phenylalanine alo”®, ») 
m combination with other amino acids; ^pe 
the administration of presumed phenyla' arse 
hydroxylase inhibitors, and (c) by selec ye 
breeding. All of the methods attempted vial 
ate a condition of excess plasma pheny aod 
nine, excretion of phenylpyruvic acid, pe 
retardation, the most striking features © ef 

uman condition. It should be noted, how® ay 
that these specific biochemical features 


the 
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or may not be directly involved in the result- 
sie dysfunction. Of further significance 
ws Rt that these studies begin with ani- 

aving normal phenylalanine hydroxyl- 


a trii : 
ns activity in contrast to the human condi- 
n, 


STUDIES ON RODENTS 


an and his colleagues at Wisconsin, 
H as Wassmuth in Germany, were the 
Stone pose the question of experimentally 
ee icing PKU and thereby making possible 

Xperimental study. 
ee Waisman, and Wyckoff (1958) 
iodur that perhaps administration of the 
sor E na of the hydroxylation re- 
cumul F phenylalanine could result in an ac- 
nanao o of substrate and elevated phenyl- 
e og ra in the organism. They began a 
addin of experimental studies on this basis by 
and eo amounts (59) of L-tyrosine, 
the di -phenylalanine alone or together to 
initial e of lactating or pregnant rats. While 
suckli mortality of the weanling, newborn, or 
acid es animals was a problem, either amino 
(2.5 alone, or together in reduced quantities 
result 3.75%), allowed normal growth. The 
drox i were inhibition of phenylalanine hy- 
atit activity to about one-third normal 
e Yy as well as more phenylpyruvate 1m 
R a which reached maximal levels 1 
ever after start of the diet. There was, how- 
$ little effect on plasma phenylalanine lev- 
their his was interpreted as substantiating 
hypothesis that Z-tyrosine, as well as 
Phenylalanine, inhibited phenylalanine 

TOxylase activity. 
Y operant conditioning of licking a dry 
kedi reinforced by water on @ 2-minute 
Sess — schedule, they attempted to as- 
impair ether their animals had any behavioral 
Sine ede Using animals on 2.5% L-tyt0- 
With w. DL-phenylalanine diets and controls, 
determiner deprivation as the motivator, they 
Xtent ri from the cumulative records the 
licking to which each group increased their 
criminatig 2-minute cycles—‘“‘temporal dis- 
Increased n, „They found the control group 
Minute ; their licking for 82.9% of the 2- 
Creaseq intervals, while the diet groups ™- 
Since se 46.8% (significant by ¢ at .01). 
ere was no pair-fed control group, it 


hya 


dip 


is impossible to determine whether this be- 
havioral deficit was a direct consequence of 
the PKU diet or a result of simple malnutri- 
tional weakness. Although phenylalanine hy- 
droxylase activity of tested animals was di- 
minished to one-third normal, the plasma 
phenylalanine was raised only slightly (1.6 
mg/100 ml to 2.8 mg/100 ml). Thus the 
biochemical criteria for PKU were not ade- 
quate. It should be noted that recent data 
(Waisman, 1960) indicate that L-tyrosine is 
unable to raise phenylalanine levels by inhibi- 
tion of phenylalanine hydroxylase and, in 
fact, increases the enzyme’s activity. 

Neill and Langford (1961) reported no suc- 
cess in simulating PKU biochemically in rats 
by 18 days on a 2.5% L-phenylalanine or a 
2.5% L-phenylalanine plus 2.5% L-tyrosine 
diet. Similar results were obtained for dosages 
of 3.75% L-phenylalanine plus 3.75% L-ty- 
rosine for 33 days. Neither were they able to 
simulate PKU biochemically with a diet of 
1% beta-2-thienylalanine (a phenylanine hy- 
droxylase inhibitor) plus 1% L-phenylalanine 
for 3 weeks. These results are directly counter 
to those reported by Auerbach et al. (1958). 

Yuwiler and Louttit (1961) and Louttit 
(1962), investigating the effects of a phenyl- 
alanine diet on brain serotonin and learning, 
used weanling rats divided by split litters into 
control and experimental groups. Dosages of 
phenylalanine administered in the diet ranged 
from 5-10%. The groups were run on an open 
field test at 46 days of age and placed on a 
23-hour food deprivation at 49 days of age. 
They were then tested on the Hebb-Williams 
series of mazes and on a black-white succes- 
sive discrimination with the basic control diet 
plus sugar as reward. After 21 days the ani- 
mals were decapitated and the brains ana- 
lyzed for serotonin. It was found that the 
phenylalanine animals had significantly lower 
serotonin (.468 ngm/gm tissue VS. 584 pgm/ 
gm) than controls. It was also reported that 
the behavioral tests were “in the direction of 

Hebb-Williams mazes 


greater errors on the bb- f 
(Louttit reports this as significant using an 
increased N) and fewer trials to criterion on 


the successive discrimination problem for the 
phenylalanine animals. There were no differ- 
ences on the open field test. Louttit concludes 
that the results do not demonstrate “mental 


) 
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deficiency because of the better discrimination 
performance and the possibility that poor 
maze performance may also be due to in- 
creased exploratory behavior.” One must agree 
with him, since in addition to the possible 
motivational differences there were no bio- 
chemical analyses of serum phenylalanine or 
urine phenylpyruvate. Thus none of the indi- 
cators of PKU were obtained. 

Huang, Tannenbaum, Blume, and Hsia 
(1961) studied the metabolism of 5-hydroxy- 
indole compounds in weanling rats fed the 
following three combinations of phenylalanine 
and/or tyrosine: (a) 3.75% L-tyrosine plus 
3.75% DL-phenylalanine for 2 and 3 months; 
(b) 3.75% L-tyrosine and 2.5% DL-phenyl- 
alanine for 2 months; (c) 2.5% DL-phenyl- 
alanine for 2 months. Urine samples were 
taken weekly and blood samples by cardiac 
puncture at 6.5, 7, and 9 weeks after start of 
the diet. The urine was analyzed for phenyl- 
pyruvic acid and 5-hydroxyindolacetic acid, 
blood plasma for phenylalanine and 5-hydrox- 
ytryptamine (serotonin), and the liver for 
phenylalanine hydroxylase activity. Blood 
plasma phenylalanine and urine phenylpy- 
ruvic acid of the experimental animals were 
higher than controls for diet No. 1 and mark- 
edly higher for No, 2, while there was little 
difference for No. 3. Similarly, 5-hydroxy- 
indolacetic acid and 5-hydroxytryptamine 
were decreased. Return to a normal diet re- 
versed these effects, They concluded that 
PRU can be induced by DL-phenylalanine 
plus L-tyrosine, while DL-phenylalanine alone 
gives only a slight effect, It is difficult to ac- 
cept this conclusion since no pair-fed controls 
were used, the levels of plasma Phenylalanine 
were low (equivalent to 4.24 mg.% on diet 
No. 2), there were no behavioral indices, and 
the effects observed were reversible, 

Loo, Diller, and Owen (1962) fed pregnant 
rats 1% DL-phenylalanine and then placed 
the pups on the same diet at 4 weeks of age, 
At 16 weeks of age they were given seven ses- 

sions of 160 trials each on a conditioned 
avoidance task. Blood from the tail was ana- 
lyzed for amount of keto acids and for 
phenylalanine. No difference in keto acids 
were found between experimental and control 
groups, but phenylalanine level was Teported 
to be significantly different (61 5 and 50 
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=3 pgm/ml for phenylalanine and control 
groups, respectively, p = 0.1). This is euri 
lent to 6.1 and 5.0 mg.%, hardly comparab 
to levels found in human PKU. The phenyl 
alanine group also failed to learn the condi- 
tioned avoidance task. F orty-five days afte) l 
the first test they were given a second arg 
ing session after having been on a normal dig l 
for 20 days. Four of the six phenylalanine an! — 
mals then succeeded in learning the task, a” 
their blood phenylalanine levels were norn y 
With such small amounts of phenylalani 
and no pair-fed controls, it is difficult to 4 i 
cept these results as directly due to phenyä À 
alanine. The nonpermanence of the effec 
when placed on a normal diet at adulthoo | 
also argues against a PKU defect. 62) | 
McKean, Schanberg, and Giarman (19 2) 
and Schanberg, McKean, and Giarman (198 g 
reported unpublished data that rats Hr 
phenylketonuric” by a diet containing 3- 5 | 
phenylalanine and 3.5% tyrosine for 14 0% 


had a greater number of errors and slow | 
performance in swimming a water maze E 
“high circulating levels of phenylalanine @ í 
tyrosine.” They also reported the reversa 

this effect when the phenylketogenic diet W 
supplemented with 5% tryptophan or, en 
glucose. No further information was A 
with which to assess the validity of the Je- 
havioral Measures or to determine the 2 
quacy of the biochemical criteria. ddi- 

_ Wang and Waisman (1961) obtained 4 
tional information on effects of feeding ly 
phenylalanine, since the D-isomer previous 
used was recognized as being deaminated iye | 
D-amino acid oxidase in the kidney to 2. ne‘ 
large amounts of Phenylpyruvic acid in ri 
urine without the metabolic byproducts met 
nine Ther fed 0, 3.5, S, or 71% LPM 
nylalanine in a Sround commercial chow tO", 
day-old Weanling rats for 3 weeks. In a0 o | 
tempt to manipulate feeding habits so 4° l 
maintain more uniform phenylalanine se", 
levels, some Tats were kept under cyclin’ 
hour light and dark conditions. The rats one 

5 or 7% L-phenylalanine diet showed ra 
effect of the light conditions, apparently H 
ing half that of normals at more frequent jon 
tervals regardless of lighting. This reduc" to d 
of intake was considered as probably due né 
L-phenylalanine. Biochemically they f0" 
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that 3.5% neither affected growth nor ap- 
preciably influenced plasma phenylalanine 
levels or gave increased phenylketone excre- 
tion. Five percent seriously affected growth 
and resulted in elevated plasma phenylala- 
nine, especially in light-dark housed animals, 
but gave inconsistently increased excretion of 
urinary phenylketones. Seven percent L-phe- 
nylalanine even more adversely affected growth 
(10% mortality in some groups) and resulted 
in plasma phenylalanine levels comparable to 
PKU humans (range from 12-70 mg.%) as 
well as consistently raised urinary phenyl- 
ketones. It was also found that regardless of 
Phenylalanine dosage, the phenylalanine hy- 
Toxylase activity of the liver was reduced 
40-60%. It is indicated that L-phenylalanine 
dosage is more important in attaining high 
Serum levels than duration of the experimen- 
tal or light conditions. While no behavioral 
ata are given, such work is reportedly in 
Progress, 

Boggs, Polidora, and Waisman (1962) re- 
Ported that pregnant and lactating rats fed 
ani containing 5 and 7% L-phenylalanine 
slant apparently able to adapt to high phenyl- 
=a in that almost normal plasma levels 
thar regained after several weeks. They noted 
was While the free phenylalanine in the milk 
was 20-40 times higher than normal, there 
nin no significant effect on plasma phenylala- 
acti levels or liver phenylalanine hydroxylase 

i he This was attributed to a relatively 
i Protein level of rat milk which makes a 
in ee increase in phenylalanine insignificant 
Was of total phenylalanine. Bogss and 
Rea (1962) report data indicating in- 
ra plasma levels in suckling pups es 
Plus s fed 7% L-phenylalanine Or 4% L 
Was — L-phenylalanine. Plasma tyrosine 

ands so found to be low in the 7% L-pheny™ 
Com ne group. These plasma levels were not 

Siren to human PKU and no phenyl- 

ais es were detected in the urine. Boggs and 

man (1964a) reported the elevation of 


pl 
alan ma phenylalanine and lowered phenyl- 


m hydroxylase activity in pregnant fe- 
nie albino rats fed 5 and 7% Z-phenylala, 


Waste e was, however, large individual 
rom An Similar determinations in fetal rats 
en E ca Pregnant females indicated plasma 
Ylalanine levels comparable to human 


PKU, but again with great individual varia- 
tion (mean plasma phenylalanine and stand- 
ard deviations were 4.9 = 1.0 and 30.5 = 28.3 
for control and 7% phenylalanine groups, re- 
spectively). The fetal plasma phenylalanine 
levels remained about twice that of the dam 
regardless of the latter’s level. They reported 
no gross abnormalities in such pups regardless 
of the increased fetal plasma phenylalanine, 
but feel that it is still likely that in utero 
brain damage did occur. Boggs and Waisman 
(1964b) reported higher plasma phenylala- 
nine and tyrosine, lower brain serotonin, and 
higher urinary phenylketones in rats fed 5 or 
7% L-phenylalanine from weaning to about 
50-60 days of age compared to pair-fed or 
ad lib controls. Phenylalanine-pyruvate trans- 
aminase activity was apparently increased 
significantly in the pair-fed groups (¢ = 3.73, 

p < .001, calculated by the writer), but not 

in the phenylalanine group compared to the 

ad lib group. Phenylalanine hydroxylase ac- 

tivity was again shown to be reduced about 

50% when compared to ad lib controls, but 

no data is given to allow comparison to pair- 

fed controls. There was an adaptation-like ef- 

fect in that there was little difference between 

the groups on these measures at 80-90 days. 

This series of studies has had limited success 

in producing the biochemical correlates of 

PKU in rats and no behavioral data are pro- 

vided. 

Polidora, Boggs, and Waisman (1963; also 
Boggs & Polidora, 1963), in the most com- 
plete report of behavioral assessment to date 
from the Wisconsin group, described a be- 
havioral testing procedure which they claim 
to be reliable in detecting impaired maze per- 
formance of PKU rats. Since they recognized 
that a task using & food- or water-motivating 
system may confound the situation, they 
J from water as a motivator in a 


chose escape 1 
water maze. They used a 6-unit, water-filled 


T maze (cf. Schanberg, McKean, & Giarman, 
1961) and 21-day-old rats on a 5% L-phe- 
nylalanine diet for the first 10 days followed 
by 1% (cf. Wang & Waisman, 1961) until 
the end of the experiment. Control rats re- 
ceived the same commercial diet without L- 
phenylalanine. Behavioral testing was done on 
three different groups aged 72, 87, and 90 
days. After training in a 50-inch straight 
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channel (5 trials) on Day 1 (pretest), the 
rats were given 7 trials per day (Days 2, 3, 4) 
in the maze followed by another 5 trials in 
the straight channel (Day 5, posttest) com- 
parable in length to the maze (100 inches). 
Successive trials were spaced 10 minutes apart. 
Time between start and finish was recorded 
as well as “transit speed” (linear extent/ 
elapsed time) and errors (whole body cul 
entry). Transit speeds were found to be 
slower on the posttest for the experimental 
groups. The authors state that this is due to 
more backtracking and circling by the experi- 
mental rats which, when taken into considera- 
tion, gave swimming speeds “practically 
identical for the two groups.” Analysis of 
variance of transit speeds as well as errors on 
the maze trials (Days 2, 3, 4) indicated a 
significant difference between experimental 
and control groups. This was complicated by 
significant effects of trials, days, and trials- 
by-days interaction. In essence, the experi- 
mental groups performed more slowly and 
made more errors. The slower performance 
resulted mainly from their higher error rate, 
slowness in gaining exit from a cul, and a 
tendency to turn in the wrong direction upon 
exit. It is also reported that the experimental 
rats had 38% lower body weight and infre- 
quent mild alopecia, although there was no 
apparent muscular weakness. While a clear- 
cut behavioral deficit seems to have been dem- 
onstrated, it is not apparent whether it is the 
result of excess L-phenylalanine per se rather 
than from a self-imposed starvation or mal- 
nutrition. Such a demonstration requires more 
adequate controls such as pair-feeding and 
“innocuous” control diets, resulting in similar 
self-imposed starvation. 

In this respect it is of interest that Boggs, 
Rosenberg, and Waisman (1963) have re- 
ported that feeding 5 or 7 % L-phenylalanine 
caused a decrease in brain serotonin even com- 
pared to pair-fed controls. A decrease in liver 
serotonin was found in both groups and at- 
tributed to the restricted food intake. The 
brains of the pair-fed controls, however, were 
able to maintain normal serotonin levels while 
the liver levels decreased, a result which is 
not understood. 

Perez and Schmidt (1963), in a pilot study 
using albino rats and intraperitoneal injection, 


administered 1.0, 1.5, and 2.0 gm. of bole 
nylalanine per kg. of body weight dissolve 

in saline (2.5 mg/ml) to neonate rats from 
about 2 hours after birth for 60 consecutive 
days. Control rats received comparable vol- 
umes of saline. Ferric chloride urine tests were 
positive for the 1.5 and 2.0 mg/kg groups E 
early as 3 hours after birth. A significan 

cumulative depression of growth occurred duf- 
ing the phenylalanine administration but was 
reversible afterwards. Of great importance ÍS 
the mortality rate which ranged from 36% {oF 
the controls to 70% for the highest dosag? 
group. Behavioral measures were obtaine 

46-64 days after the last injection and after 
a 10-day adaptation to a 23.5-hour we 
deprivation schedule. Animals were trained Í 

a straight runway, 10 runs a day for 5 days. 
Each animal was then given 10 trials a dey 
in a Lashley III maze until reaching a ct 
terion of 9/10 errorless trials, They Tour 
that male experimental animals made sigri i 
cantly more errors and tended to require mor 

trials. While this method has been found : 
raise serum phenylalanine for about 10 how! 
after injection (peak of about 160 mg/ f 
after 2 hours*), more extensive recent 4 

tempts to replicate these behavioral rest 2 
have been negative (Perez, 1964).* It is ead 
gested that behavioral impairment found w 
ing phenylalanine administration may be 7 
to toxic effects of phenylalanine rather thal 
true PKU. One weakness of these results n 
the possible selective factor of mortality; 07 
the biggest and best survive. To be empha” 
sized, however, is that the behavioral testi”? 
was done after ample time to recover from 

specific administration effects with the pe 
nent rational that to adequately stimula : 
PKU the effect must be permanent. This 2, 
lows the use of a Motivator compatible W 

standard behavioral assessment techniq" 
without confounding the situation, but ¢ 

not obviate controls (difficult in this case) set 
determine the effects of the self-imposed i 


o c ak 
restrictions of the amino acid imbalance 4 
mals, 


tir 


. he 
3V. J. Perez. Phenylpyruvic oligophrenia 1n i 
rat. IT. Quantification of serum L-phenylala™ 
Personal communication, January 1965. othe 
4V. J. Perez. Phenylpyruvic oligophrenia in 
rat. I. Personal communication, January 1965- 
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Woolley and Hoeven (1963) briefly re- 
ported that weanling mice fed 3.5% DL- 
phenylalanine plus 3.596 L-tyrosine gave 
similar changes in urinary excretion of phenyl- 
pyruvic acid as shown by Auerbach et al. 
(1958) and in serotonin and catechol amines 
of the tissues as shown by Huang et al. (1961) 
and Wang, Harwalkar, and Waisman (1962). 
They also reported a slight increase in learn- 
ing ability measured by a T maze with escape 
from light as the motivator. Needless to say, 
this is counter to the results of Auerbach 
et al. (1958) for rats performing a different 
task, but similar to that reported by Louttit 
(1962) for discrimination learning. They es- 
plain this result by pointing out that the cru- 
cial period in phenylketonuria is early infancy, 
Whereas the animals used in the study were 
asalam They claim evidence (Woolley, 
aah that it is a serotonin deficiency induced 
d a in life which damages the intellect, while 

“ciency induced in adulthood somehow en- 
ances learning ability. It is of interest that 
“a report that the observed behavioral 
Pe ig were not permanent, the mice ob- 
ti ing normal scores several days after cessa- 
on of treatment. In another study (Woolley 
even, 1964a) they began force-feeding 
Phe within 24 hours of birth .01 ml. of DL- 
( Nylalanine (20 mg/ml) and L-tyrosine 
mg/ml) nine times per day. The dosage 
increased with growth so that at 2 weeks 
boxe they were receiving .1 ml/hr, at which 
force-feeding was stopped and 3.5% each 
th +-Dhenylalanine and L-tyrosine added to 
T At 7-8 weeks of age they were 
testeq on a normal diet for 3 days and then 
Com in a T maze as in the previous study. 
ne fee to two groups of control animals, 
diet Orce-fed water followed by @ nouma 
Dhen a the other reared normally, the 
” alanine-treated animals did more poorly 
Urine exhibited phenylpyruvic acid in the 
Chlorpy, ther animals given reserpine 3 
ciency Omazine to produce serotonin defi- 
îlarly during development performed sim- 
8roup hey report that the phenylalanine 
tended ta physically normal at maturity pi 
Studies o be irritable and less active. m 
"epor (Woolley & Hoeven, 1964b, 1964c 
toni ed that when serotonin cogeners (mela- 
and HTP) were administered con- 


ch 


mice 


Was 
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currently with the phenylalanine diet from 
birth, “the mental defect was prevented 
even though excesses of phenylalanine and 
. . . phenylpyruvic acid still remained in the 
tissues.” 

This is an impressive series of studies, hav- 
ing not only claimed the production of the 
mental defect of phenylketonuria in mice but 
also its prevention. The claim of mental de- 
fect is based upon trials to learn a T maze, 
which data are amazingly tight and consistent 
as reported but which are somewhat mislead- 
ing for assessing the significance between ex- 
perimental treatments (not reported). The 
standard deviations reported reflect the vari- 
ance of a population of subgroup means rather 
than the variance within each experimental 
treatment. This latter variance among indi- 
vidual mice is considerable, which would lead 
one to suspect the significance of the small 
mean differences obtained (generally about 
one trial in a test of maze learning, e.g., 7.4 
correct out of 10 trials for normals vs. 8.3 
correct out of 10 trials for phenylalanine 
mice). The fact that these authors are ap- 
parently able to consistently reproduce these 
small differences, however, indicates some real 
difference between treatments. McMillan 
(1964) has recently emphasized that the be- 
havioral effects were not permanent but re- 
stricted to the period of treatment, suggesting 
impaired performance rather than learning. 
That no analyses other than the presence of 
phenylpyruvate in the urine were done in 
order to assess better the biochemical simu- 
lation of PKU also weakens these studies, 
since the diet used has not been successful 
Sih rats (see above). The phenylpyruvats 
may very well be due to the nonphysiologica 

i which is converted in the kidney. 
it ig such as the lack of pair-feeding 
math pine a role. On the other hand, the 

g! J . 
maze task using light avoidance and the 
later use of shock-avoid 


ing. These 
replicated b 


lly evaluated. 
can Pe lt (1958) and Ammor (1961) te- 


t an attempt to produce PKU in a] 
a subcutaneous (261 mg.) or Intravenous 
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(131 mg.) injection of DZ-phenylalanine be- 
fore copulation, during pregnancy, and during 
lactation. Of 14 offspring, 2 died at birth, 3 
had partial paralysis and subsequently died, 
while the rest were described as “unruly” and 
“nervous” with unhealthy, “shaggy and 
patchy” fur, especially on the head. Even 
though their EEGs were normal and no sig- 
nificant increase of phenylpyruvic acid in the 
urine was found, the author concludes that 
these young exhibited “disorders comparable 
with those of human phenylketonuria.” With 
the use of the nonphysiological isomer, no 
controls, and scanty data it is difficult to ac- 
cept such a conclusion. 

An interesting study by Coleman (1960) 
Suggests another approach to experimental 
PKU. It was reported that certain strains of 
mice (DBA/1J and DBA/2J) were charac- 
terized by dilute Pigmentation, extreme ner- 
vousness, and seizures. These mice had a di- 
minished phenylalanine hydroxylase activity 
ranging from 14-86% of that of an unrelated 
strain (C57BR/cdj). Evidence was found for 
the presence of an enzyme inhibitor, possibly 
phenylacetic acid. Of further interest is the 
recent report by Wimer and Weller (1964) 
that the DBA/2J strains did poorer on a 
black-white discrimination task than a 
C57BL/6J strain. These studies are suggestive 
enough to make further research with 
proach appear promising. 

Watt and Vandevoorde (1964) have re- 
cently reported that rats fed .2% DL-4- 
fluorophenylalanine, a competitive inhibitor of 
phenylalanine hydroxylase, excreted approxi- 
mately five times more phenylpyruvic acid 
than controls. The use of enzyme inhibitors 
alone or together with excess Phenylalanine 
may be a productive approach, although there 
are apparently no known completely specific 
inhibitors of phenylalanine hydroxylase. 


this ap- 


STUDIES ON MONKEYS 


Waisman, Wang, Palmer, and Harlow 
(1960, 1962; Waisman, 1962) report a 
lengthy, ongoing study of four macaque mon- 
keys separated from their mothers as soon as 
possible and reared on surrogate mothers (cf. 
Harlow, 1958). They were given increasing 
dosages (up to .5 gm/100 ml) of L-phenyl- 
alanine from soon after birth for 15 months. 


The average daily intake of L-phenylalanine 
approximated 2.75 gm/kg body weight. Regu- 
larly determined blood phenylalanine indi- 
cated generally high plasma phenylalanine, 
but the consistency of elevation was quite 
variable in each animal (interpolations from 
their figures give ranges from approximately 
1 mg.% to 50 mg.%). The authors suggest 
that this frequent fall in level is probably due 
to poor food intake. Throughout the study 
the ferric chloride test, 2, 4-dinitrophenyk 
hydrazine test, or the Phenistix test as we 

as quantitative analysis, strongly indicate 

the presence of phenylketones in the urine. 
Behavioral evaluation of delayed reaction, 
object discrimination, the Hebb-Williamt 
maze series, and discrimination-learning a 
(cf. Harlow, 1959) indicated poorer es 

formance on delayed response and possibly 
the Hebb-Williams mazes and discrimina 

tion-learning set. Individual scores for hi 
four experimental animals are contrasted, api 
parently, to a mean score of an indeter™! 

nant number of normal controls. Tt would be 
important to know if the differences observe 

were significant in view of the variation amon’ 
the four experimental animals (there is r 
mention of normal variation). Nor is ther” 
mention of how the normal controls (PY° 
sumably reared under identical conditions p: 
isolation with surrogate mothers) would p 
form compared to normally reared monkey” 
In other words, what is the role of the reari”? 
conditions in the behavioral deficit on the 
tasks, especially since such conditions ê" 
known to produce deficits in other behavior” 
dimensions (Harlow, 1962)? Green and bi 
don (1964), for example, have recently dem 
onstrated that monkeys reared in isolati” 
show less interest in looking at such objects 

food, geometric forms, the mothers, peers» 

an empty chamber, Although no data on f0° 
intake is given for comparing normal and @ 
perimental animals, the weight gains seem al 
be comparable, implying that the experimen’ 
animals did not voluntarily restrict their vs 
as has been observed in other studies. Neve 

theless, paired-feeding would again have gi¥° 

additional validity to the study. It is of som 
interest that the authors noted grand- & 
Seizures in some of their animals (Waisman | 

Harlow, 1963), since about 25% of hum 
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PRUs also 
1963), 

Cadell, Harlow, and Waisman (1962) re- 
ported EEG changes in these monkeys raised 
on an L-phenylalanine diet. Resting records 
of phenylalanine animals showed a predomi- 
nance of slow waves, but none showed the 
3/sec spikes and waves reported by Fois, Ros- 
enberg, and Gibbs (1955) for human PRU. 
Normal controls showed predominance of high 
frequency activity. Some phenylalanine ani- 
mals had reduced following responses to inter- 
mittent photic stimulation. Thus there was 
little similarity to EEG activity of human 

KU. It would also be important to know if 
these monkeys and controls were raised on 
Surrogate mothers. EEG changes toward slow- 
ing have been reported for short-term sensory 
isolation even in adult humans (Zubek, Welch, 
& Saunders, 1963). 

Waisman, Hable, Wang, and Akert (1964) 
il also reported ultrastructural changes in 
ate cells and myelinization of brains of rats 
ala One monkey raised on an excess phenyl- 
g nine diet. Since previous reports from this 
oo have not adequately demonstrated 
P U in rats or monkeys analogous to human 
sug j these results must remain only highly 

gestive, 
isle these studies seem the most 
ng, the small N, large variability, lack of 
ta controls, and the possible contamina- 
ie by the rearing conditions requires further 
pee before concluding that PKU can be in- 
ced in monkeys.’ 


exhibit such seizures (Jervis, 


prom- 


PKU CRITERIA 


me disease was originally called phenyipy 
bioch oligophrenia in recognition of both the 
Chemical and behavioral characteristics. 
ee changed in recent years to phenyl- 
sige wi reflecting the biochemical empha- 
While biochemical investigation has 


5 
rae More complete report of this work has since 
doubts ished (see Waisman & Harlow, 1965). It is 

6 hil if it changes the evaluation. , 

Dossibl the two terms are used interchangeably, it 
entities e that they should be reserved for separate 

heny]; in order to avoid confusion, that 1$, 
s tnea aiie oligophrenia for the inclusive bio- 
Or the hon, behavioral syndrome; phenylketonur 
Symptoms, emical symptoms without the behaviora 
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grown rapidly, behavioral investigation of 
PKU has been neglected, consisting almost 
solely of intelligence testing and clinical ob- 
servation. There has been no experimental 
study of PKU abilities or behavior. Neverthe- 
less, it is obvious that PKU as a meaningful 
pathology entails a relatively specific bio- 
chemical syndrome accompanied by a rather 
gross behavioral syndrome labeled mental re- 
tardation. One without the other cannot be 
PKU in any meaningful sense. Therefore, ex- 
perimental attempts to produce PKU must 
use both adequate biochemical criteria and 
behavioral criteria if they are to speak of 
their results as PKU. There has been no study 
to date meeting this requirement. Many 
studies have, however, assumed adequate 
demonstration of PKU in animals submitted 
to certain procedures and have consequently 
accepted these procedures, thereby perpetu- 
ating and compounding the error. 


Chemical Criteria 

Since the original discovery of phenylpy- 
ruvic acid in the urine and increased plasma 
phenylalanine, followed by the finding of in- 
sufficient liver phenylalanine hydroxylase ac- 
tivity, a number of additional biochemical 
differences between PKU and normal humans 
have been found. Tt is necessary that experi- 
mental attempts to produce PKU increasingly 
strive to duplicate whatever biochemical dif- 
ferences are essential and sufficient for the 
condition. Presently these would seem to be 
the presence of both phenylpyruvic acid in the 
urine and a serum phenylalanine level above 
about 20 mg.% in conjunction with reduced 
phenylalanine hydroxylase activity. In addi- 
tion, serum tyrosine levels should be low, a 
result usually not found in the experimental 
studies thus far. It is also apparent that a 
general abnormality of tryptophan metabo- 
lism exists which may have a broad effect on 
brain development since it is a precursor of 
serotonin and even nicotinic acid, Sot of 
which have profound effects on the CNS and 
mental functioning. In this respect, a number 
of studies cited have demonstrated that excess 
phenylalanine results in decreased levels of 
serotonin in the brain or other tissues (see 
also, €g» Culley, Saunders, Mertz, & Jolly, 
1962; Hsia, Nishimura, & Brenchley, 1963), 
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It is possible that other compounds such as 
orthohydroxyphenylacetic acid, phenylacetic 
acid, phenyllactic acid, and even plasma amino 
acid patterns may be of direct importance for 
the pathology, in which case they should be 
included as indicators in experimental studies. 
Recently Boggs, McLay, Kappy, and Wais- 
man (1963) have demonstrated another bio- 
chemical similarity of the experimental con- 
dition to that found in PRU, namely, de- 
creased excretion of 5-hydroxyindolylacetic 
acid in monkeys, 

Some experimenters feel that the sine qua 
non of phenylketonuria is the presence of 
phenylpyruvic acid in the urine. While this 
was the first recognized index giving the dis- 
ease its name, it is doubtful if it should be 
considered the most basic characteristic of 
the larger entity, especially in the case of ad- 
ministering D-phenylalanine, It seems essen- 
tial to use the L-isomer since the D-isomer is 
a nonphysiological form and is not involved 


n to phenyl- 
pyruvate by the kidney—giving positive urine 


tests without yielding the other metabolic de- 
tivatives possibly involved in PKU. 
The picture seems further complicated by 


with the bio- 


Le Tea variant combinations) These 
uid indicate that the resentl i 
biochemical 4 ple se 


: ‘sponsible for 
cases presenting unusual syndromes, implying 
g 


that PKU may not be a homogeneous disease 
caused by a single defect, 


Behavioral Criteria 


Since there has been little behavioral re- 
search on PKU other than IQ testing and 
clinical observation, there are no specific be- 
havioral indices whose analogs could be used 
in animal work. With only the rather nebu- 
lous description of “mental deficiency” to go 
by, the task in animal experimentation is far 
from simple. It is of some help to note the 
clinical descriptions of PKU behavior such as 


hyperactive, irritable, unruly, schizophrenic- 
like, and the existence of seizures. These may 
be taken as possible guides, but they are pres- 
ent only in some cases and are also known to 
be present in many other pathological condi- 
tions. 

Ti “mental deficiency” is taken to mean im- 
paired learning performance and adaptive be- 
havior (social and environmental), its analog 
in animals should be measurable. Work to date 
has generally used performance on a single 
task (maze Performance, discrimination learn: 
ing, etc.) to determine “mental deficiency’ 
with the implicit assumption that performance 
on other tasks will be similar (Waisman et al, 
1962, is a notable exception here). This !$ 
unacceptable even by IQ-test procedure 
Where a range of behavior is tapped on 4 
number of subtests. This problem is a recut 
ring one when assessing general ability 1 
animals (cf, Rosenzweig, Krech, & Bennett, 
1960; Searle, 1949, on maze “bright” a" 
maze “dull” rats). Tt is possible that a co” 
cept of a general intelligence factor is les 
applicable to animal behavior than to human 
behavior. 

With no guide to the type of tasks that 
might be discriminative for PKU individuals 
it is necessary to start with a variety of tra 
ditional measures in animal behavior unt! 
appropriate information from human PKU # 
obtained. Certain tasks seem to have obvious 
Worth in this connection: the Hebb-Willia™® 
mazes, discrimination learning and revers®! 
and avoidance learning, to cite a few. An) 
tasks chosen must also be regarded in relation 
to the methodological problems discussed P 


low. 
METHODOLOGICAL PROBLEMS 

Many of the st 
ated by methodological problems. The pre 


Adequate Controls 


Only recently have basic controls such 4f 
paired-feeding begun to be used (cf. Polidor 
et al., 1963) with no behavioral results as Y° 
published. There is some question if pair 
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feeding is a sufficient control, since the amino 
acid imbalance in the phenylalanine-fed group 
1S not present in the pair-fed group. It may be 
better to produce a comparable amino acid 
imbalance by feeding a different amino acid 
not associated with mental deficiency. 

Studies using a particularly traumatic 
Means of induction (interperitoneal injection 
m neonates or very high dietary levels in 
Sestating or lactating females, etc.) also re- 
quire consideration of the selective factor of 
the high mortality which occurs. 

Further controls of environmental-develop- 
Mental effects need to be used in light of the 
Yeported effects on learning ability and brain 
Chemistry (Hunt, 1961; Krech, Rosenzweig, 

Bennett, 1960). There is too little known 
2 the complex interaction of development and 
Xperience on later learning behavior, while 

S known effects on social behavior suggest 
oa isolation-reared animals are more closely 

gous to mental defectives than to nor- 
in s. It is questionable, therefore, if isolation- 
red animals should be used to investigate 
€ effects of a third factor with as diffuse 
effects as PKU. 
in eae using a dietary method of naue 
cont here has been little concern for precise 
rol of Phenylalanine content of the stock 
consid, enerally used. This is known to = 
of erably and may be responsible for som 
€ variations obtained in results. 
th asiy, there has been little replication 0 
iochemical criteria and less replication 
© reported behavioral changes. Replica- 
and increased numbers of animals are @ 


Eci s 3 
essity to discount random procedural and 


i . 
logical variation. 


tion 
b 


Motivation 


The 
Studies 
p tivati 


assessment of behavioral changes 17 
of this nature is complicated by 
is Onal differences between the groups. 
Pro hg known that amino acid imbalance 
cific changes in dietary habits and, spe- 
a Yl that excess phenylalanine prodas 
hun misty decrease in food intake. Thus, 1 
Ser is used as a motivator, the control 
otiya perimental groups will be differentially 
fect ed. Pair feeding while controlling the 
owp > 9f reduced food intake introduces its 
Steki, ects of increased motivation (food- 
8) since the pair-fed group essentially 


is on a deprivation schedule. It is reasonable 
to assume that thirst is also subject to this 
reasoning on the basis of the interaction be- 
tween food and water intake. Many studies 
have consequently used some sort of avoid- 
ance or escape task to circumvent these prob- 
lems. The Wisconsin group’s use of escape 
from water in maze performance (Polidora et 
al., 1963), and Woolley and Hoeven’s (1963) 
use of escape from light offer promise at 
present. In addition, shock avoidance in a 
discrimination task or shuttlebox may be 
productive. Such avoidance motivation may 
not be without its own problems, for example, 
escape from light may be increased by height- 
ened skin sensitivity from excess phenyl- 
alanine known to decrease pigmentation and 
to give rise to eczematous conditions, or 
escape from shock may be confounded by 
skin-resistance changes produced by unbal- 
anced diets affecting the skin. These problems 
are avoided by allowing sufficient time to 
elapse after administration of excess phenyl- 
alanine so that such motivational changes can 
return to normal (cf. Perez, 1964). Human 
PRU, if not given therapy shortly after 
birth, results in permanent damage. There is, 
therefore, every reason to expect behavioral 
effects in experimental animals some months 
after cessation of phenylalanine excess if a 
true analog of PKU has been produced. 


Developmental Variables 


Since the specific lesion resulting in PKU 
occurs in early infancy when the brain is still 
developing and therapy is most effective if 
begun at this time, developmental factors are 
of utmost importance 1m attempts to produce 
PKU in animals. Many studies have hig’ 
with adolescent or young ac aa in 
which it is doubtful at the outset if a igos 
brain effects can be produced. aie, evel: 
mental differences seem responsible for s 


se of 
of the divergent results reported. The = Sy 
1 or neonatal animals would seem neces 
je ives rise to problems of inducing 
3 thods have been used to date: 
the pregnant or lactating 
i i the neonate, intra- 
special diet to 
female, T injection of the neonate, or selec- 
A nels. While the special diet to the 
emit or lactating female is most conveni- 
eit mortality of the young is sometimes high. 
; 


sary, but 
PKU. Four me 
special diet to 
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The fact that the enzyme phenylalanine 
hydroxylase becomes active, with some varia- 
tion, shortly after birth (Freedland, Krakow- 
ski, & Waisman, 1962; Kenny, Reem, & 
Kretchmer, 1958) should be emphasized in 
selecting the developmental stage with which 
to work. 


Species Differences 


Results must also be interpreted in relation 
to the specie upon which they have been 
obtained. It is obvious that the behavioral 
tasks chosen may have differing importance 
for the specie involved, for example, simul- 
taneous versus successive discrimination. For 
work with monkeys, the Wisconsin group (cf. 
Waisman et al., 1962) has chosen a reason- 
able range of tasks. Just as obvious, however, 
is the importance of species (as well as 
strain) differences in biochemical lesion, for 
example, phenylacetic acid metabolism, Bess- 
man (1964) has recently pointed out that 
biochemical differences may exist phylogeneti- 
cally which may “render inapplicable to man 
some of the observations on experimental 
PKU in the rat and monkey... .” Rats, for 
example, have 10 to 15 times more phenyl- 
alanine hydroxylase activity than do humans 
(Waisman & Harlow, 1963), making it ex- 
ceedingly difficult to produce an excess of 
phenylalanine, while monkeys have levels 
comparable to humans. Upon these considera- 


tions, primates would seem to be the best 
suited for this work. 


GENERAL CONCLUSIONS 


The production of mental retardation in 
experimental animals by biochemical, en- 
vironmental, or other experimental means 
provides an important tool for the study of 
intellectual ability. It is doubtful, however 
if this has yet been realized for PKU. It 
seems that claims for the production of PKU 
in animals have been premature. There have 
been no studies meeting the criteria discussed 
and also free of basic methodological errors, 
While many have demonstrated an increase in 
serum phenylalanine, it is generally far below 
the levels found in human PKU even though 
phenylalanine hydroxylase activity may be 
reduced 20-40%. Similarly, many studies 
have reported behavioral effects whose valid- 
ity and reliability are questionable. Some 


laboratories have now apparently begun to 
systematically investigate the conditions 
needed to adequately and reliably produce 
the biochemical criteria and the role of de- 
velopmental variables. The coupling of these 
endeavors with more adequate behavioral 
assessment should produce the informatio? 
now lacking in this potentially fruitful area 
Many studies have observed severe growth 
inhibition in their experimental animals. 
There is little evidence for reduced body size 
or weight in human PKU (Jervis, 1954 
1963) which may lead one to reason that 
the growth inhibition and mortality observe 
in animal studies is atypical and may reflec 
Processes different than true PKU but whi 
could easily result in performance decrements 
These processes may involve simple malnut!™ 
tion due to reduced dietary intake and amin? 
acid imbalance. The techniques used to PI 
duce excess phenylalanine require that it ha 
to be absorbed into the blood from the g" 
or peritoneal cavity with the possibility ° 
inhibitory effects of phenylalanine on acti” 
transport of other amino acids which may 
also produce a malnutrition state. In buma" 
PKU, by contrast, the excess is produce’ 
directly in the blood via the liver. There * 
evidence that malnutrition at certain stage 
of development can adversely affect behavio™ 
In this connection kwashiorkor, a general pre 
tein deficiency widespread in South America 
and African-Asian countries and which = 
detrimental results on intellectual and beba” 
ioral development (cf. Scrimshaw & Beban 
1961), may be of some relevance. The effect? | 
of this deficiency are, however, possibly r 
versible in contrast to PKU. It is of interes 
that Some liver enzyme activities are 1” 
early in dietary protein deficiency (Kerr 
Auerbach, & Linn, 1956). Recently, Cow! 
and Griesel (1959, 1962, 1963 1964) havi 
reported behavioral deficits fn first 3 
second-generation rats raised on low prot® 
diets. 
, Many of the PKU studies to date are P" D 
liminary reports and convention papers WP 
rely upon unpublished data and observatio"; 
In the future it is to be hoped that cael 
extensive and complete reports will appeat : 
that results can be more readily evaluate | 
Many problems in this work are similar , 
those encountered in experimental behav! 
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_— Thus, the investigator attempting to 
Tie? a would do well to be cognizant 
Sex the or k. Lastly, the studies reviewed 
of PKU _ of experimental production 
of the , os important step in our knowledge 
velegmeet ations between biochemistry, de- 
ested o aia, intelligence, as well as a 
entei nge in orientation of research on 
ardation. 
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This review highlights 
of measurement found in many stu 
effectiveness of positivi 


a number of metho 
dies whi 
e social reinforcemen 
s has frequently 


children. The omission of control group: 


founding of reinforcer effects wit 
Problems relating to the use of di 
of reinforcement, the measurement of 
of mediating response strat 
research focus on the possi 
tingency and on the influence 0 


h ei years have seen the publication of 
drena of studies attempting to influence 
reinforce, motor behavior by “positive social 
Subject? ment.” In most of these studies, the 
in or A task was to take marbles from a 
them in 'spenser, one at a time, and to insert 
of hole Ş hole in the apparatus; the number 
nine, E available has ranged from two to 
chiefly ‘ee reinforcement has consisted 
menter, 1 praise statements by the experi- 
mitted n some cases the experimenter also 
Ocus on smiles and nods. This review will 
‘E design apparent deficencies in methodology 
Verbal in studies dealing with the influence 
Ve mot, reinforcement on children’s repeti- 
Scribeq — performance of the type de- 
t Measy ach of the empirical indices used 
conside social-reinforcer effectiveness will 
idered, followed by a discussion of 


e 
Problems of method. 


ar 
of 


PREFERENCE-RATIO MEASURES 


he 
measurement of the presumed effect 


Soci 3 

` ial reinforcement has been expressed in 
Te š 
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Zi Nike, K are indebted to D. M. Baer, L. D. 
n or hel Patterson, H. W. Stevenson, and E. 

Sole. > Pape elpful criticism of an earlier version 
“ly the T, but the opinions herein expressed are 

responsibility of the authors. 


1 
by 
ę] 


h regress! 
fference scores, the e 
developmental changes, and the function 
egies are dis 
ible role of awareness of the reinforcement con- 
f other mediating responses. 
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dological weaknesses and problems 
ch have attempted to evaluate the 
t on repetitive motor behavior in 
resulted in the con- 
jon-to-the-mean or warm-up effects. 
ffect of various schedules 


cussed. Recommendations for future 


terms of a change in ratio of critical over 
total responses before and after introduction 
of the reinforcement condition. The “prefer- 
ence ratio,” introduced by Gewirtz and Baer 
(1958b), represents the number of responses 
to the less-preferred hole (X+) divided by the 
total number of responses to both holes 
(Xo + Yo) during the last minute of a 4- 
minute nonreinforcement base-line period. A 
comparable preference ratio (X,/X; + F+) is 
obtained for the reinforcement period or for 
several intervals within this period. In this 
ratio X, represents the number of responses 
to the hole which had been less preferred dur- 
ing the base-line period, and is thus the hole 
to which responses were reinforced during the 
d. The difference between 


experimental perio ; € 
the values, (X:/Xr + Y,) and (X/X» + J as 
is a preference-change score. In some in- 


stances the score calculated for a subject 
has been a mean or median value of several 
preference-change scores computed by sub- 
tracting 4 single base-line-period ratio from 
the ratio obtained for each of several 
cement intervals. 


consecutive reinfor' 
d Baer preference change 


The Gewirtz an 
score (1958a, 1958b) has also been used by 


Walters and Foote (1962), Walters and Ray 
(1960), and Zigler and Kanzer (1962). Un- 
fortunately this scoring system entails the 
questionable assumption that the base-period 
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preference becomes stable after a short inter- 
val of responding, and can be reliably meas- 
ured by a ratio based on the subject’s fourth 
minute of performance on the task. Since none 
of these studies used nonreinforced control 
groups, the reported effects of social reinforce- 
ment may have been confounded with the 
statistical artifact of regression to the mean, 
with trends within time, or both.? 


Regression to the Mean 


Regression to the mean is found when later 
measures are taken on groups of subjects 
originally constituted on the basis of their 
extreme scores. In the case of preference- 
change scores for the two-hole, marble- 
dropping task, the regression effect could stem 
from the procedure of selecting the subject’s 
less-preferred position (hole) response as the 
one to be reinforced. In a study by Patterson 
and Hinsey (1964), for example, some sub- 
jects exhibited no stable position preference 
during the base period but fluctuated around 
the chance (50%) point between the two 
possible response alternatives. For such sub- 
jects the use of a brief base period for the 
determination of the base-period preference 
score, X,/X, -+Y,, should, even without re- 
inforcement, tend to yield X,/X, + Y r values 
larger (i.e, closer to chance level) than the 
base-period Preference scores, 

Cairns (1962), 
task, has demonstrat 


base period the 
Xy/Xy + Vy + Zi, 
inforced and the randomly reinforced control 
ng the 
group 
nt every time the 
ccurred, while the 
d for every third 
ch knob was being 


least-preferred response o 
control group was praise 
response regardless of whi 


2 Reference to any stud: 
method problem should no: 
the results of the study 
ambiguous, since in some o: 
manipulation of an indep 
confirmation of the empiri 


y in connection with a 
t be taken to imply that 
are necessarily rendered 
f these investigations the 
endent variable Provided 
cal dependent variable. 


used. During the first 10 trials of the E 
forcement period both groups rose to a vale 
preference ratio of approximately .33, a va r 
which corresponds to chance performance M 
would seem probable that the preference É 
increases of .10 (.33-.23), found irrespec a 
of selective reinforcement, were due to regre 
sion to the mean. : tifact 
The importance of this regression ar not 
for the two-hole, marble-dropping task has ine 
been explored. The effect appears to il 
consequential, however, when the pres ; 
ratio is derived from at least 4 amo | 
performance (Patterson & Hinsey, 196 05 
Some investigators have avoided the ie 
sible influence of statistical regression. nple: 
Cullers and Stevenson (1960), for ere 
randomly selected the response alternativi 4 
be reinforced on a task which involved Tne 
ing one of three knobs on each trial. n Te 
report no significant differences bem yea! 
inforcement and control groups of 8-1 l 
old subjects, oidin 
As yet untried is the possibility of av ouble 
the regression artifact by the use of a pas? 
base-line period. The first interval of me pe 
Period could be used solely to a, nd 
subject’s less-preferred response. The § pe 
interval could then be used to der w 
base-period preference ratio, While this gsi0™ 
would be independent of statistical regre er 
a control group should be employed tO ov! 


: : r 
tain whether systematic trends occu 
time. 


Stability of Individual Differences / 


nc? 
Investigators have tried to relate prefe e | 
change scores to measures of child P vio" 
ality or parental child-rearing 3 
(Cairns, 1962; Patterson, Littman, & / 1 el 
1964), but in the absence of substant iit 
tionships with other variables the reli int 
of preference change as an index lis, 
individual differences should be esta ate, 
Patterson and Hinsey (1964) report 500", 
retest correlation for preference-change 7 
of 73 (N= 24) when testing ne 
Separated by 1 week. This value is a soy 
ingly high for the reliability of a chan ie e 
(see Bereiter, 1963) and should be eY r 
in terms of certain procedural details. a 
Statements were dispensed on a 


Ta 
= 
a = 
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schedule for the first 20 less-preferred re- 
sponses and then on a 2:1 ratio schedule 
for the next 40 appropriate responses. This 
Schedule made it possible for the subject 
to develop a correct performance strategy 
Unrelated to the reinforcement contingency. 
By increasing response rate, subjects were 
able to increase the number of reinforce- 
a obtained per unit time without chang- 
ps the proportion of responding to the two 
= Adoption of this strategy, or of an 
tegen strategy, would result in rela- 
Sa: “ small but comparable preference-change 
slong, The abi the first and second ses- 
Sonat 6 reliability coefficient of .75 re- 
Prob: Bi by Patterson et al. (1964) would 
netta] y have been lower had the experi- 
inten „task permitted only one strategy for 
unit pe the number of reinforcements per 
ime. 
ther addition to the point just discussed, 
re a the possibility that subjects who 
Stee an of the reinforcement contingency 
Perio the first session may enter the base 
OF set: pe the second session with expectancies 
Ways S oe their performance in diverse 
Corre] Fi thus appears that the test-retest 
or st ‘on may be an inappropriate statistic 
chan aluating the reliability of preference- 
esti Scores. An alternative approach lies in 
£ mene of the reliability of preference- 
Perfor, Scores from the internal reliability of 
conditin ne” during the first-base and first- 
betwee a8 periods and from the correlation 
Guilford erformance during these two periods 
, 1954, p, 393). 


RATE or Response MEASURES 


b Sometimes social reinforcement is studied 
after E whether the subject speeds up 
Nereas troduction of the reinforcer. Rate- 
'Ypicaty Scores based on this approach are 
trac med difference scores derived by sub- 
from 42 the response rate during a base period 
Petiog € response rate during an experimental 


experimenters implicitly make the 

iW ut questionable assumption that in 
to „While thi 
tei riina behavior certainly represents a response 
top rcer E it is not reflected by the index of 
"sider a ectiveness and, in this context, must be 
“to be an artifact. 


Ome 
“tuciaq 


the absence of reinforcement the response 
rate of child subjects is stable over time. This 
assumption has led to studies lacking non- 
reinforced control groups because each sub- 
ject is thought to be “his own control” 
(Grossman, 1963; Stevenson, 1961; Steven- 
son, Keen, & Knights, 1963; Stevenson & 
Knights, 1962b; Stevenson & Odom, 1962). 
It is unlikely, however, that measures based 
on brief base periods can serve as controls 
for time-related changes such as warm-up 
effects, fatigue effects, and the effects of sub- 
jects’. varying hypotheses and motivational 
states. 

The existence of such time-related changes 
was reported by Zigler (1963). Since this 
study did not incorporate a base period, the 
responses during the first minute of the 
6-minute reinforcement period will be com- 
pared with later performance, for the purpose 
of this exposition. The nonreinforced group 
(N = 64) showed a mean-per-minute rate 
increase identical with that of the reinforced 
group (V = 64)—1.6 responses. While this is 
a relatively small upward drift in response 
rate, its importance can be seen in relation 
to a study by Stevenson and Odom (1962). 
There the overall, mean-per-minute rate in- 
crease for subjects receiving social reinforce- 
ment was less than 1.0. 

Suggestive evidence for the existence of an 
upward response-rate drift can be found in a 
study by Stevenson and Knights (1962a) who 
report an increase in response rate over three 
separate testing sessions. Of tangential inter- 
est is the failure to demonstrate the influence 
of positive social reinforcement when a non- 
reinforcement control group was employed 
(Kelly & Stevens, 1964;* Parton & Ross, 
1964; Stevenson & Fahel, 1961; Zigler, 
1963). It should be noted, however, that 
Andronico (1963) reports a ginena 
higher, lever-pulling rate during a -minute 

iod for a reinforced group 


reinforcement per! 
than for a nonreinforced control group. 


Several factors may contribute to response- 
rate drift. The most fundamental of these is 
probably the warm-up effect which is fre- 
quently encountered during the initial periods 


med by the reviewers using the 


is perfor 
rene Ho ted by Kelly (1962). 


original data presen 
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of motor-task performance; another may have 
to do with the availability of a terminal goal. 
In studies where subjects are provided with a 
supply of marbles to be dropped in a hole 
(e.g., Stevenson & Fahel, 1961) the presence 
of the supply box furnishes a clear terminal 
goal, that of emptying the box. The fact that 
the box may hold an abundant supply of 
marbles (as many as 2,200 in the study by 
Stevenson and Fahel) may have little influ- 
ence on young children who might adopt the 
unrealistic goal of emptying the box and, in 
working toward this goal, change their 
response rate. 

It is possible that many children in psycho- 
logical experiments expect feedback on their 
performance from the adult experimenter. 
When the experimenter remains silent, as he 
does when running a nonreinforced control 
group, a child subject may institute a strategy 
of increasing his rate of response in order 
to assure himself that the experimenter is 
still aware of him, or to elicit from the 
adult a comment which might indicate that 
he is doing satisfactorily or has reached the 
stopping criterion. Should such an effect oper- 
ate with a nonreinforced control group a 
measure based on rate of response would, of 
course, obscure the social-reinforcement effect. 

In addition to the two aforementioned prob- 
lems which beset measures based on the rate 
of response, there is a question about the sta- 
bility of rate-change scores. Patterson and 
Hinsey (1964) had the same female adult 
administer social reinforcement to 24 child 
subjects for two sessions 1 week apart. They 
report ® a test-retest correlation of .19 which 
might be interpreted as reflecting low stabil- 
ity of the rate-increase measure. The authors 
note, however, that during the second session 
the range of rate increase scores was severely 
constricted. If social reinforcement is effective 
in increasing the performance rate of children 
during the first experimental period, their 
high response rate might still be in evidence 
during the base period of the second session. 
The authors conclude that this increment in 
response rate during the base period of the 
second session constricted the range of possi- 


5 In an unpublished, expanded version 


l of Patterson, 
Littman, and Hinsey (1964). 


ble rate-increase scores, reducing the corre } 
lation between the two testing sessions, and 
thus produced a deceptively low index of E 
ability. The suggestion offered above for eva” 
uating the stability of preference changê | 
scores is also relevant for rate-change scores. | 


TIME MEASURES 


Studies using time measures to eaud 
the effectiveness of social reinforcement a | 
all included nonreinforced control group 
thus providing a clear test of the influence | 
the experimental manipulation. waa the 

One such measure is the total time was | 
child spends at the task. This approach vai 
used by Zigler, Hodgden, and Stevensi 
(1958); Zigler and Wiliams (1963); ort 
Shepps and Zigler (1962); all of whom reP les | 
significant reinforcement effects with ipea | 
of retarded children. On the other 1961) | 
Green and Zigler (1962) and Zigler G d 
found no difference between reinforced rf 
nonreinforced groups for samples of retat“ y 
children. Two studies report nonsigni ton 
reinforcement effects with normal chil & 
(Green & Zigler, 1962; Zigler, Hodge 
Stevenson, 1958) while more recently 4’ 
(1963) found a significant effect with oot 
of normal and retarded children. In sumni j 
there appears to be evidence for the effec he 
ness of social reinforcement on time a re 
task with both normal and retarded chil at” 

Some studies have used the number © nce 
bles inserted as a measure of perform? ite 
Such a score is the product of respons? pe 
and the length of time the subject stays “tt 
task. To date there has been one demor nee” 
tion that social reinforcement affects this rus? 
sure of test performance. Stevenson and cel? 
(1961) found a significant overall differen & 
the performance of a reinforced group, A e” 
perimenter-present control group, and Pnet 
perimenter-absent control group. The port 
number of marbles dropped by the re! ip? 
group was more than four times that ° yw 
experimenter-present control group, pr att 
ably a significant difference, Stevens?” 40) 
Snyder (1960) report a mean number ° ri e” 
marbles inserted by an experimentet -P g i 
control group while the reinforced 810™ jo? 
serted a mean of only 126.3 marbles, ĉ 
significant difference. 
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USE or DIFFERENCE SCORES 


When base performance is correlated with 
= e the determinants of base perform- 
Bess oe determinants of change. Correla- 
and oe than —.30 between base rate 
the Pad change have been noted in three of 
(Patton studies reporting this correlation 
Mal & Hinsey, 1964; ‘Stevenson, Dol 
Prefer S Fahel, 1961; Zigler, 1963).° For 

1964) 4 ratios, Patterson and Anderson 
from — ig negative correlations ranging 
analyses 9 to —.80 for several studies. Since 
ine based on difference scores fail to 
effect y the source of variation producing an 
Provid i more appropriate procedure can be 
Period, with analysis of variance. Base- 
entered a experimental-period scores can be 
actor as two subgroups in a within-subjects 
tuting with the experimental groups consti- 
Viner gee meen-subjects factor (e8; se 
action p P. 302 fi.). A significant inter- 

asi: €tween these two factors is equivalent 
Gre ery group effect based on difference 
design a addition, this analysis-of-variance 
rel tee information indicating which 
action he produced the significant inter- 

Y inspection of the cell means.’ 
© aforementioned factorial design ap- 
More appropriate than an analysis-of- 
ance procedure in which experimental 
terion ac Period performance serve as the cri- 

‘he as Concomitant variables, respectively: 
"ate and ively low reliability of base-petlo 
Sey 1 Preference scores (Patterson & Hin- 
cates O45 Stevenson & Knights, 19624) indi- 
adequater, analysis of covariance would fail 
Further Y to correct for initial differences. 
datą More, for the studies under review the 
ari hoe not have met the analysis of co- 
“i Cites ption of linear regression © 

“sures On measures on the concomitant 
pou y (Lindquist, 1953, p. 323). The pre- 
agg NOted negative correlations between 
change performance probably re- 


Dears 
“Ovary 


ar 


o 
pte negative correlations were spuriously 
in boti the same errors of measurement a 
“orre P- 34 Oth sets of scores being correlated. Lor 
lation ) offers a formula which estimates the 
Ppp Tors geen true base rate and true change 
fop The aup, measurement partialed out, 
‘ ors are indebted to L. G. Humphreys 


estin, 
E this mode of analysis. 


flect a ceiling effect (at least for the rate 
measure). This ceiling effect would be re- 
flected in a departure from linearity of regres- 
sion of the criterion measures on the con- 
comitant measures. 

Some investigators have exercised caution 
in using difference scores as an interindividual 
index of social-reinforcement responsiveness. 
In two studies (Patterson & Anderson, 1964; 
Patterson, Littman, & Hinsey, 1964) an 
attempt was made to eliminate the influence 
of magnitude of base-period preference (based 
on an N of 160). If the usual assumption is 
made that error of measurement is not cor- 
related with magnitude of the control variable 
(base-period preference ratios) a problem is 
posed by the fact that those subjects who 
exhibit base-period preference ratios close to 
50 also tend to show small preference-change 
scores (Patterson & Hinsey, 1964). For base 
preferences nearest to .50, subjects’ change 
scores will thus incorporate a greater propor- 
tion of measurement error than will the change 
scores of subjects with more extreme base- 
period preferences. The T transformation, 
however, serves to equate extreme scores at 
each level without regard to this proportional 
difference in error of measurement, thereby 
resulting in unreliable change scores for sub- 
jects at the level nearest .50. Considering only 
the reliability problem and the necessity of a 
large sample for deriving a ee this 

imique cannot be recommen ed 
as appes to be little, if any, justifica- 
tion for the use of difference scores 1n gored 
gating relationships among ery a a 
ence variables. Such analysis fai s to in a 
whether the correlations obtained as me 
tion of base-period relationships, ee aki 

eriod relationships, oF diterna in be 
bility between the two testings. T i prere i 
-ve is to analyze base and experimenta 
es separately; and to investigate 
ae oF ains in performance by comput- 
ee : rrelations between experimental- 
ing Par erformance and each individual dif- 
perio variable, with base-period perform- 
penn as the control 


variable (see Lord, 1963). 

ScHEDULES OF REINFORCEMENT 
A variety of schedules of reinforcement have 
been used in the studies under consideration. 
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While acknowledging that there are practical 
advantages for selecting a specific schedule 
(e.g., controlling the number of reinforcements 
per unit time) there often has been no justi- 
fication for the choice of a particular schedule. 
Whenever interval schedules are used in con- 
junction with a rate of response index of re- 
inforcement effect it is impossible for the sub- 
ject to change the number of reinforcements 
per unit time (Hartup, 1964; Parton & Ross, 
1964; Stevenson, 1961; Stevenson & Fahel, 
1961; Stevenson et al., 1963; Stevenson & 
Knights, 1962a, 1962b). Thus, regardless of 
the subject’s response rate, compliance with 
the experimenter’s initial instructions to per- 
form the task results in a fixed number of re- 
inforcements (due to all subjects operating at 
the limit characteristic of the schedule). The 
use of a schedule which is in effect “non-rate- 
contingent” may entail either the assumption 
that reinforcement has an, as yet unspecified, 
energizing effect, or the belief that the differ- 
ential influences produced by ratio or interval 
schedules with tangible reinforcers (Long, 
Hammack, May, & Campbell, 1958) occur 
upon brief exposure of a subject to a social- 
reinforcement situation. When the dependent 
variable under investigation is response rate it 
would seem more appropriate to use a ratio 
schedule of reinforcement since such a sched- 
ule makes the number of reinforcements per 
unit time contingent on the subject’s response 
rate (Andronico, 1963; Lockwood, 1963; 
Stevenson & Odom, 1962; Zigler, 1963). 
When a particular response class is being 
reinforced, as in the reinforcement of the 
least-preferred response on the two-hole, mar- 
ble-dropping task, even a ratio schedule of 
reinforcement poses problems. This stems 
from the fact that a subject can increase the 
number of praise statements per unit time by 
either increasing the rate of response to both 
holes or changing the proportion of marbles 
dropped into each hole. Since Patterson et al. 
(1964)° found no correlation between rate 
change and preference change it is quite con- 
ceivable that some subjects may respond to 
the reinforcement manipulation with a rate- 
increase strategy while the experimenter is 
measuring only change in preference. It should 
also be considered that the use of a 100% 
fixed-ratio schedule, even for a short period, 
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. J 
may have the effect of reinforcing a subjects 
alternation behavior rather than a position re- 
sponse as intended by the investigator. Som? 
of these difficulties can be avoided if the bi 
hole, marble-dropping situation is viewed ng 
as a simple operant task, but as a two-cholee 
decision-making task in which the subject de 
termines the intertrial interval. If a standa" 
intertrial interval were to be used with i 
two-hole, marble-dropping task it shou 4 
eliminate the subject’s confounding strateg 
of increasing his rate of response and na 
thus provide a more sensitive assessment i 
the influence of social reinforcement. In a a 
tion the use of a fixed intertrial interval © 
reduce the subject’s response rate and tme 
lower the possibility of a pattern of respon g 
being perceived as the contingency for re! 
forcement. 


DEVELOPMENTAL CHANGES 


t 
Careful consideration must be given tO 
constitutes appropriate dependent varia y 
when assessing developmental changes Poul 
sponsiveness to social reinforcement. MC% g 
lers and Stevenson (1960) found that agn 
year old children exhibit stronger por it 
preferences than 8-10 year olds. Two oppe be 
but equally unjustified conclusions mig ip 
drawn from such age-related difference? at 
position preference. One might conclude "1, 
the younger children are more respons}¥° er 
social reinforcement if, due to their strc 
position Preference, they enter the rein ore! 
ment period with smaller base-period, , 
crence-ratio scores, thus raising the ma*! bie 
increase scores possible for them. while vo 
effect would bias the results in favo" jo" 
younger children, the opposite cone yall? 
might result from the employment of 4 spe” 
reinforcement schedule where, because of gee 
stronger position preference, younger CH yo 
would receive fewer reinforcements P° 
time. gin? 
Another stereotyped mode of respo y i 
found in young children is the tender ye 
alternation of response. Craig and wae 
(1963) found that kindergarten childre” yi 
play more alternation behavior than patty 
dren in the fourth and eighth grades- 1 
son and Hinsey (1964) report that 4” ‘jas 


those subjects who alternated during t 


| 


0 
oh 
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period showed no response to the social-re- 
inforcement manipulation. Such age-related 
differences in behavior suggest that preference- 
=e scores as currently used are not suit- 
or work attempting to assess develop- 
Mental changes, 
as he of rate-increase scores for assess- 
social . developmental changes in response to 
ment reinforcement also embraces measure- 
that problems. Stevenson (1961) has reported 
0 are naserntE triples from ages 3 and 4 
hoe 9 and 10. This marked difference in 
i ii Performance suggests that ceiling 
change-sc partially determine the results of 
Score analysis for such data. 


Mepiatinc RESPONSES 


to The heuristic value of a cognitive approach 
(Spiana reinforcement has been established 
reson erger, 1965), and DeNike (1964) has 
with a evidence indicating that, at least 
site fo, lege subjects, awareness is a prerequi- 
ertain such learning. Among those studies 
dren oS to the social reinforcement of chil- 
nder motor behavior, only Patterson and 
of th Son (1964) have investigated the role 
men, SUbiect’s awareness of the reinforce- 
Contingency. Patterson and Anderson 
nonsignificant correlations between 
eryj, Ss ratings based on postexperimental 
intervie’ and preference-change scores. The 
inforcen used to assess awareness of the re- 
ter: Ment contingency was developed by 
Orce o et al. (1964). It was designed to 
Verba; © Subject who did not spontaneously 


Whi alize a strategy to “guess” the hole on 


in reinforcement had been contingent. Us- 
Contin “Point scale for rating awareness of the 
tect Sency, subjects who guessed the incor- 
thoge © Were assigned a score of 1 while 
Score o © guessed the correct hole earned a 
Signeq . (the scores of 2 and 4 were as- 
Verbaling, irrelevant strategies or spontaneous 
ations of correct strategies, respec- 
Weight quais procedure, which arbitrarily 
and € random guesses of some subjects 
tion Jt of others low, confounds the evalu- 
rence, the influence of awareness on pref- 
(1964, Mge scores. Patterson and Hinsey 
tos s have furthermore pointed out that 
“Ddjects who show frequent response 
lons are reinforced for maintaining 


report 
arene 
1 


this alternation behavior and frequently show 
zero preference-change scores. Those subjects 
who alternate and are judged aware would 
thus tend to attenuate the correlation between 
awareness and preference change. 

The two-hole, marble-drop task provides an 
excellent means of investigating the influence 
of both awareness of the correct strategy and 
the incidence and influence of other response 
strategies, such as a speed strategy and an 
alternation strategy. Subjects could be as- 
signed to strategy groups on the basis of a 
postexperimental interview and their experi- 
mental performance then analyzed in terms 
of preference, rate of response, and response 
alternation. In addition, it would be important 
to consider the subject’s verbal report of how 
much he liked receiving the praise words and 
whether he implemented his strategy (ie., 
worked to maximize the amount of reinforce- 


ment). 
CONCLUSIONS 


Precise application of operant methodology 
has proven of value in the study of reinforce- 
ment effects with children (Bijou & Baer, 
1963; Long et al., 1958), but many of the 
studies here reviewed abbreviated operant 
technique by using only one estimation of 
base-line performance and one estimation of 
reinforced performance. If the investigator is 
serious about adopting an operant approach, 
repeated assessments of base-line and rein- 
forced performance should be obtained in or- 
der to show whether a reliable reinforcement 


effect is present. Whatever method orientation 
it appears fruitless to use 


is adopted, however, 1 u 
Snail situations which proni i 
development of response strategies unrela 
to the empirical dependent variable. ae 
There is a need for the ane a mor 
appropriate tasks for assessing the 7 = 
of social reinforcement. The assumption 
tasks must have low intrinsic interest is sub- 
j to question. When intrinsic interest 1s 
i th subject’s continued performance is 
rient a function of the experimenter’s in- 
structions, which suggests that performance is 
«med at obtaining or maintaining the experi- 
eaters approval. To a great extent base-line 
performance may be contingent on the sub- 
ject’s desire for social approval. Under these 
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circumstances the introduction of praise state- 
ments following a long base period may serve 
as a cue to terminate performance, emphasize 
a strategy which was started during the base 
period, or search for the experimenter’s rein- 
forcement contingency in order to terminate a 
boring task. Some of these problems could be 
circumvented by devising a task in which the 
occurrence of either of two instrumental re- 
sponses results in an interesting visual display 
(or tangible reward). Social reinforcement 
could be made contingent on one of these re- 
sponses and a preference measure employed as 
an index of reinforcer effectiveness. 

Among the studies reviewed, even the in- 
vestigations which report significant reinforce- 
ment effects do not reveal marked control 
over behavior. As Bijou and Baer (1963) have 
indicated, this may be due to the same stimu- 
lus serving as a positive reinforcer for one 
child, a neutral event for another, and pos- 
sibly even as a negative reinforcer for a third. 
It may thus be unreasonable to expect that 
social reinforcement will uniformly facilitate 
performance in a random sample of children. 
It must also be acknowledged that there has 
been no demonstration that the stimuli intro- 
duced as positive reinforcers in these studies 
served more than a cue function. Might it not 
be enlightening to introduce a random rein- 
forcement condition along with an adequate 
posttest interview with the aim of determining 
whether preference-change scores and the re- 
port of a least-preferred-hole strategy varied 
across contingent and random- 
conditions? 

Investigation of the events and Processes 
which lead to individual variation in the in- 
centive value of social reinforcement for chil- 
dren, while a difficult venture, holds prospects 
of illuminating much of the socialization 
process. Such endeavors deserve and demand 
a sound methodological base. 


reinforcement 
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N=1 


WILLIAM F. DUKES 


University of Calijornia, Davis 


Studies 


focused on the behavior of only one individual are examined. Despite 


the limitation on generalizing inherent in such studies, selected examples attest 
to their importance in the history of psychology. Their frequency in modern 
psychology is noted, together with their distribution across major topics. The 
usefulness of an N of 1 in research is viewed as extending beyond the single- 
case studies of clinician and personologists. An N of 1 is seen as also appro- 
priate when, for the function considered, intersubject variability is low, when 
opportunities for observing a given class of events are limited, and when a sup- 
posed universal relationship is questioned and the obtained evidence is negative. 


In the search for principles which govern 
behavior, psychologists generally confine their 
empirical observations to a relatively small 
sample of a defined population, using proba- 
bility theory to help assess the generality of 
the findings obtained. Because this inductive 
process commonly entails some knowledge of 
individual differences in the behavior in- 
volved, studies employing only one subject 
(N = 1) seem somewhat anomalous. With no 
information about intersubject variability in 
performance, the general applicability of find- 
ings is indeterminate. 

Although generalizations about behavior 
rest equally upon adequate sampling of both 
subjects and situations, questions about 
sampling most often refer to subjects. Ac- 
cordingly, the term “N = 1” is used through- 
out the present discussion to designate the 
reductio ad absurdum in the sampling of 
subjects. It might, however, equally well 
(perhaps better, in terms of frequency of 
occurrence) refer to the limiting case in the 
sampling of situations—for example, the use 
of one maze in an investigation of learning, 
or a simple tapping task in a study of motiva- 
tion. With respect to the two samplings, 
Brunswik (1956), foremost champion of 
the representative design of experiments, 
speculated: 


In fact, proper sampling of situations and problems 
may in the end be more important than proper 
sampling of subjects, considering the fact that indi- 
viduals are probably on the whole much more alike 
than are situations among one another [p. 39]. 


As a corollary, the term N = 1 might also 
be appropriately applied to the sampling of 
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experimenters. Long recognized as a potenti 
source of variance in interview data (ef 
Cantril, 1944; Katz, 1942), the investis? 
has recently been viewed as a variable W a 3 
may also influence laboratory results (e87 
McGuigan, 1963; Rosenthal, 1963). 
Except to note these other possible usse 
of the term N = 1, the present paper s 
concerned with one-experimenter OY a 
situation treatments, but is devoted, 45 
cated previously, to single-subject studies: 


firs! 
Despite the limitation stated in the g 


ea SS 
paragraph, N = 1 studies cannot be dis ae 
as inconsequential. A brief scanning of 8°" iI 


and historical accounts of psychology pots 
dispel any doubts about their import ott! 
revealing, as it does, many instances of P’ com 
research in which the observations Wer < of 


fined to the behavior of only one pers? 
animal. 


SELECTIVE HistoricAL REVIEW 


in 

Foremost among N = 1 studies is Evje 
haus’ (1885) investigation of memory: < pis 
by some authorities “a landmark in Pê gil 
tory of psychology . , , a model whl? 951 
repay careful study [McGeoch & Irio®: 
p. 1],” considered by others “a reme y pett 
at least as bad as the disease Bash 
1932, p. 3],” Ebbinghaus’ work estab o' 
the pattern for much of the resea!® 
verbal learning during the past 80 ye 
principal findings, gleaned from many, | 0 
administered learning situations consist gy 
some 2,000 lists of nonsense syllab es a 
42 stanzas of poetry, are still valid K a 
material for the student of memory: 
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other well-known pioneering study of learn- 
ing, Bryan and Harter’s (1899) report on 
aaa, certain crucial data were obtained 
tom only one subject. Their letter-word- 
es analysis of learning to receive code 
a ad on the record of only one student. 
i notion of habit hierarchies derived 
E this analysis is, nevertheless, 
Sor: ul in explaining why plateaus may 
Familiar even to beginning students of 
r piin is Stratton’s (1897) account of 
ao from and the adjustment to 
ee inverting lenses. In this experiment 
onl ing to Boring (1942), Stratton, with 
y himself as subject, 


Si 
mie both Kepler's problem of erect vision with 
een Image, and Lotze’s problem of the role 
the « aa es in space perception, by showing that 
absolute” localization of retinal positions—up- 
and right-left—are learned and consist of 
orientation as context to the place of visual 
on [p. 237]. 


down 
bodily 
excitati 


wae tole of experience was also under 
raising in the Kelloggs’ (1933) project of 
home, one young chimpanzee, Gua, 1m their 
chavi (Although observations of their son $ 
the ma were also included in their report, 
since Peda essentially of the N =1 type, 
One.) no, experimental group” consisted of 

his attempt to determine whether 
tion Petience may modify behavior tradi- 
a sta Y regarded as instinctive was for years 
learnin ee reference in discussions of the 

Poca; a uration question. ; : 

alloon. in the area of motivation is the 
gists p Vallowing experiment of physiolo- 
Whic anon and Washburn (1912) in 
Stomac Ymographic recordings of Washburn S 
mith hi contractions were shown to coincide 
Their S introspective reports of hunger pangs- 
Psycho ndings were widely incorporated into 
Nation By textbooks as providing an expla- 
Bteater į hunger. Even though in recent years 
al f Importance has been attached to cen- 
ctors in hunger, Cannon and Wash- 
Dlac wak continues to occupy a prominent 
behavior textbook accounts of food-seeking 


aoe 

t m 

wd he literature on emotion, Watson 

Ring QYner’s study (1920) of Alberts 
Onditioned to fear a white rat has 


been hailed as “one of the most influential 
papers in the history of American psychol- 
ogy” (Miller, 1960, p. 690). Their experiment, 
Murphy (1949) observes, 


immediately had a profound efect on American 
psychology; for it appeared to support the whole 
conception that not only simple motor habits, but 
important, enduring traits of personality, such as 
emotional tendencies, may in fact be ‘built into’ 
the child by conditioning [p. 261]. 


Actually the Albert experiment was unfinished 
because he moved away from the laboratory 
area before the question of fear removal could 
be explored. But Jones (1924) provided the 
natural sequel in Peter, a child who, through 
a process of active reconditioning, overcame 
a nonlaboratory-produced fear of white furry 
objects. 

In abnormal psychology few cases have 
attracted as much attention as Prince’s 
(1905) Miss Beauchamp, for years the model 
case in accounts of multiple personality. An 
excerpt from the Beauchamp case was re- 
cently included, along with selections from 
Wundt, James, Pavlov, Watson, and others, 
in a volume of 36 classics in psychology 
(Shipley, 1961). Perhaps less familiar to the 
general student but more significant in the 
history of psychology is Breuer’s case (Breuer 
& Freud, 1895) of Anna O., the analysis of 
which is credited with containing “the kernel 
of a new system of treatment, and indeed a 
new system of psychology [Murphy, 1949, p: 
307].” In the process of examining Anna’s 
hysterical symptoms, the occasions for their 
appearance, and their origin, Breuer claimed 
that with the aid of hypnosis these symptoms 
were “talked away.” Breuer’s young oe 
was Sigmund Freud (1910), who later pare 
declared the pon of this case in the 

ic of psychoanalysis. 
aes E other instances, maybe not so 
he preceding, of influential 
spectacular as the p Yerkes’ (1927) 
N=1 studies—for example, er : al 
exploration of the gorilla ae ae R 
activities; Jacobson s (1931) Stuay 
omuscular activity and thinking in an 
ins tee; Culler and Mettler’s (1934) demon- 
soe n "ot simple conditioning in a de- 
a ake dog; and Burtt’s (1932) striking 
‘itustration of his son’s residual memory of 


early childhood. 
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TABLE 1 


Torat DISTRIBUTION or N = 1 Srupies (1939-1963) 


Category Í 


Examples 


Maturation, 
development 


29 


S 3 ól); 
Sequential development of prehension in a macaque (Jensen, 1968; 
smiling in a human infant (Salzen, 1963) 


i f r ading 18 
Differential reinforcement effects of true, esophagal, and sham feeding 


a dog (Hull, Livingston, Rouse, & Barker, 1951) 


Motivation 7 
Emotion 12 
Perception, 25 


sensory processes 


Anxiety levels associated with bombing (Glavis, 1946) 


i zipni, 
Congenital insensitivity to pain in a 19-year-old girl (Cohen, Kip 
Kunkle, & Kubzansky, 1955); f 


>i stabilize! 
figural aftereffects with a stabi 


retinal image (Krauskopf, 1960) 


Learning 27 
Thinking, language 15 
Intelligence 14 
Personality 51 


Mental health, 66 Multiple personality ( 
psychotherapy 
Total 246 


Delayed recall after 50 years 
(Hayes & Hayes, 1952) 


Keats’ personality from his 
adult of P and R techniqu 


s ye n “hi mpanze? 
(Smith, 1963); imitation in a chimp 


i vech in* 
“Tdealess” behavior in a chimpanzee (Razran, 1961); opposite speech 
schizophrenic patient (Lafial & Ameen, 1959) 
Well-adjusted congenital hydrocer 
intelligence after lobectomy in an 


1, 1947 


alic with IQ of 113 (Teska 
epileptic (Hebb, 1939) 


in oP 
m na 

poetry (McCurdy, 1944); comparison ™ 

es (Cattell & Cross, 1952) 


ice #8 
Thigpen & Cleckley, 1954); massed practi 


therapy for patient with tics (Yates, 1958) 


Further documentation of the signiñcant 
role of N = 1 research in psychological his- 
tory seems unnecessary. A few studies, each 
in impact like the single pebble which starts 
an avalanche, have been the impetus for 
major developments in research and theory. 
Others, more like missing pieces from nearly 
finished jigsaw puzzles, have provided timely 
data on various controversies. 

This historical recounting of “successful” 
cases is, of course, not an exhortation for 
restricted subject samplings, nor does it 
imply that their greatness is independent of 
subsequent related work. 


FREQUENCY AND RANGE oF Topics 


In spite of the dated character of the cita- 
tions—the latest being 1934—. = 1 studies 
cannot be declared the product of an era 
unsophisticated in sampling Statistics, too 
infrequent in recent psychology to merit 
attention. During the past 25 years (1939— 
1963) a total of 246 N = 1 studies, 35 of 
them in the last 5-year period, have appeared 
in the following psychological periodicals: 
the American Journal of Psychology, Journal 


= 
„mal 
oj Genetic Psychology, Journal of Abnorn 
and Social Psychology, Journal of Educat? d 
Psychology, Journal of Comparative eri 
Physiological Psychology, Journal of PAD 
mental Psychology, Journal of Applied log)? 
chology, Journal of General Psycho er 
Journal of Social Psychology, Journal of est 
sonality, and Journal of Psychology. 
are the journals, used by Bruner and 
(1940) in their survey of 50 years of © ni 
in American Psychology, selected as yes of 
cant for and devoted to the advancemet if 
Psychology as science, (Also used P oy 
Survey were the Psychological Reviews "ox 
chological Bulletin, and Psychometrikt sjy 
cluded here because th ee 
publish original empiri oe es" 
these 246 studies consti l Pje 
cent of the 1939-1963 
absolute number is noteworthy and is ee 
enough to discount any notion that / 
studies are a phenomenon of the past. ted 
When, furthermore, these are distrib" teh 
as in Table 1, according to subject a pe 
they are seen to coextend fairly well wit? ss 
range of topics in general psychology 


lipo 
hane 


ey do not oF 
cal work.) A 
tute only a sma 


journal articles: ple 


Aiii TT 


might be expected, a large proportion of them 
fall into the clinical and personality areas. 
One cannot, however, explain away V = 1 
studies as case histories contributed by cli- 
nicians and personologists occupied less with 
establishing generalizations than with ex- 
Ploring the uniqueness of an individual and 
understanding his total personality. Only 
about 30% (74) are primarily oriented 
toward the individual, a figure which includes 
not only works in the “understanding” tradi- 
tion, but also those treating the individual 
R a universe of responses and applying tradi- 
‘onally nomothetic techniques to describe and 
raiet individual behavior (e.g, Cattell & 
ross, 1952; Yates, 1958). 
an actual practice, of course, the two orien- 
ee toward uniqueness Or generality— 
ee a matter of degree than of mutual 
ioe with the result that in the litera- 
Wier purely idiographic research is 
Bie rare, Representative of that ap- 
of ra are Evans’ (1950) novel-like account 
Spt iller who “spontaneously” recovered his 
Sat after more than 2 years of blindness, 
Osen’s, (1949) “George X: A self-analysis 
Dr an avowed fascist,” and McCurdy’s (1944) 
ofile of Keats, 


RATIONALE ror N = 1 


ant appropriateness of restricting an idio- 
from i study to one individual is obvious 
İS ny, he meaning of the term. If uniqueness 
Bone a sample of one exhausts the 
is a es At the other extreme, an N of 1 
Reneralip nro priate if complete population 
Sumeq "ty exists (or can reasonably be as 
indiyj to exist). That is, when between- 
Scrutj ual variability for the function under 
fron, UY is known to be negligible or the data 
Sit Single subject have a point-for-point 
e nee with those obtained from depend- 
jec ateral sources, results from a secon 
= t may be considered redundant. Some 
tion, _ Studies may be regarded as approxima- 
this ideal case, as for example, Heine- 
(1961) photographic measurement of 
images and Bartley and Seibel’s 
i Sully of entoptic stray light, using 
Vener method. 
When ariant on this typicality theme occurs 
e researcher, in order to preserve 


TETE 


some kind of functional unity and perhaps to 
dramatize a point, reports in depth one case 
which exemplifies many. Thus Eisen’s (1962) 
description of the effects of early sensory 
deprivation is an account of one quondam 
hard-of-hearing child, and Bettelheim’s 
(1949) paper on rehabilitation a chronicle of 
one seriously delinquent child. 

In other studies an V of 1 is adequate 
because of the dissonant character of the 
findings. In contrast to its limited usefulness 
in establishing generalizations from “positive” 
evidence, an V of 1 when the evidence is 
“negative,” is as useful as an V of 1,000 in 
rejecting an asserted or assumed universal re- 
lationship. Thus Krauskopf’s (1960) demon- 
stration with one stopped-image subject 
eliminates motion of the retinal image as 
necessary for figural aftereffects: and Len- 
neberg’s (1962) case of an 8-year-old boy 
who lacked the motor skills necessary for 
speaking but who could understand language 
makes it “clear that hearing oneself babble 
is not a necessary factor in the acquisition 
of understanding . . . [p. 422].” Similarly 
Teska’s (1947) case of a congenital hydro- 
cephalic, 63 years old, with an IQ of 113, 
is sufficient evidence to discount the notion 
that prolonged congenital hydrocephaly re- 
sults in some degree of feeblemindedness. 

While scientists are in the long run more 
likely to be interested in knowing what is 
than what is not and more concerned with 
how many exist or in what proportion they 
exist than with the fact that at least one 
exists, one negative case can make it ne- 
cessary to revise a traditionally accepted 
hypothesis. Sdi , n 

i Still other V = 1 investigations simply re- 
flect a limited opportunity to observe. W hen 
the search for lawfulness 1S extended to in- 

«nonlaboratory” behavior, individu- 
frequent not i tudy may be 

Js in the population under study may 
al . acai distributed spatially or temporally 
of the ‘psychologist can observe ea on 

f which may be uselu' as 


report 0 
ae fa mulative record. Examples of 
par 


this include cases of multiple personality 


ilateral color 

i & Cleckly 1954), unile 
(Thigpen (Graham, Sperling, Hsia, & Coul- 
on 1961) congenital insensitivity to pain 
(Coben et al., 1955), and mental deterio- 
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ration following carbon monoxide Poisoning 
(Jensen, 1950). Situational complexity as well 
as subject sparsity may limit the opportunity 
to observe. When the situation is greatly 
extended in time, requires expensive or 
specialized training for the subject, or entails 
intricate and difficult to administer controls, 
the investigator may, aware of their explora- 
tory character, restrict his observations to 
one subject. Projects involving home-raising 
a chimpanzee (Hayes & Hayes, 1952) or 
testing after 16 years for retention of ma- 
terial presented during infancy ( Burtt, 1941) 
would seem to illustrate this use of an N of 1. 

Not all V = 1 studies can be conveniently 
fitted into this rubric; nor is this necessary. 
Instead of being oriented either toward the 
person (uniqueness) or toward a global theory 
(universality), researchers may sometimes 
simply focus on a problem. Problem-centered 
research on only one subject may, by clarify- 
ing questions, defining variables, and indi- 
cating approaches, make substantial con- 
tributions to the study of behavior. Besides 
answering a specific question, it may (Ebbing- 
haus’ work, 1885, being a classic example) 
provide important groundwork for the theo 
rists. 

Regardless of rationale and despite obvious 
limitations, the usefulness of V = 1 studies in 
psychological research seems, from the pre- 
ceding historical and methodological consider- 
ations, to be fairly well established. 


(See 
Shapiro, 1961, for an affirmation of the value 
of single-case investigations in fundamental 


clinical psychological research.) 
status in research is further se 
statistician’s assertion 
that: 


Finally, their 
cured by the 
(McNemar, 1940) 


The statistician who fails to see that important gen- 
eralizations from research on a single case can ever 
be acceptable is on a par with the experimentalist 
who fails to appreciate the fact that some problems 


can never be solved without resort to numbers 
[p. 361]. 
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COMMENTS ON 


“NEED STATES AND 


ACTIVITY LEVEL”: 
A CRITIQUE 


L. GLENN COLLINS 


Mental Health Research Institute, Fort Steilacoom, Washington 


A note on the paper by Baumeister, Hawkins, and Cromwell (1964) to point 
out the risk inherent in relying on secondary sources. These reviewers stated 
that copulation by the male rat does not result in decreased activity levels 
and cited Morgan and Stellar (1950) as support for this statement. The rele- 
vant paragraph in Morgan and Stellar refers to an experiment by Stone and 
Barker (1934) which does not support the reviewers’ statement. 


An important point in Baumeister, Hawkins, 
and Cromwell’s recent paper (1964) is that 
exponents of reinforcement theories find it 
difficult to explain the finding that copulation 
does not serve to reduce the “running wheel” 
activity of the male rat. They gave as a 
reference Morgan and Stellar (1950). Lack- 
ing a page reference but being familiar with 
that book, this reviewer was able to find the 


section to which (it was assumed) the 
reviewers were referring. 


The male rat does not show the periodicity charac- 
teristic of the activity of the female rat but main- 
tains, rather, a relatively constant level of activity. 
There is, moreover, little or no relation between the 
general activity of a male rat and the strength of 
its sexual drive as measured by the readiness with 
which it will avail itself of a receptive female (Stone 
and Barker). In this respect, therefore, there is no 
important relation between activity and strength of 
drive. 

Of interest also is the fact that reduction in sexual 
drive has no visible effect on acivity, for animals 
allowed a period with a female rat in “heat” are 
just as active in a revolving drum as before [Morgan 
& Stellar, 1950, p. 371]. 


80 


i 
cers part 
Referring to Stone and Barker’s P® 


(1934), nothing was found in the be 
tables to support the statement by ser 
and Stellar that copulation does nO sta 

to reduce the activity of the male. This Y 
ment may or may not be valid but me i 0 
ence cited fails to support it. This ae av 
be another instance in which reviewen jabl 
unfortunately depended upon an Hetrat 
secondary source and unwittingly Pe™P 

a misinterpretation. 
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of nocturnal dreaming has bi 


The study 
together and rı 


studies. This article brings 
an attempt was made to activ 
of methods, aims, and results, i 
manipulated, and thus it may 
approach in which dreams become more 
behavior, rather than only memories. 


oa research is carried out with two 
ional Sey sof procedures. One is the observa- 
Serye ae approach, where we ob- 
ake ser hat happens naturally” and try to 
dlations by of it by finding relations or cor- 
‘ervations tae various aspects of our ob- 
Of the ord : his approach leads to statements 
teDort re er, “subjects classified as Sensitizers 
T dreaming significantly more 
on the or n those classified as Repressors 
Oach į MPI (Tart, 1962). The other ap- 
tively S the functional method, where we 
and done pide one factor or variable 
- Thi ve the effect of some dependent vari- 
Order, « S approach leads to statements of the 
amount A large dose of alcohol reduces the 
fereg T EEG stage 1-REM time and in- 
(Gresha reaming in normal Subjects - - P 
a en, Webb, & Williams, 1963). Both 
àS they es are used in most fields of science, 
: complementary. 
be $ hee the nature of dreaming has 
h (is large, a matter of correlational 
s Patil for the moment the last 
search using the electroencephalo- 
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‘esearch, Stanford University 


een mainly a matter of correlational 


eviews those scattered studies in which 
zely influence dreaming. Despite a great diversity 
t is apparent that dreaming can be experimentally 
be possible to develop a functional experimental 


immediate and observable aspects of 


gram, EEG, and rapid eye movement, REM, 
techniques). Researchers asked subjects how 
often they dreamed in color, whether their 
dreams were mainly pleasant or unpleasant, 
and the like, and then attempted to correlate 
these observed characteristics with personality 
characteristics, intelligence, sex, age, etc. Or 
the patient brought in a spontaneous dream, 


and the therapist-researcher then attempted to 


“interpret” it in the light of what he knew 


about the patient. 

We have, of course, learned a great deal 
about the nature of dreaming as a result of 
these correlational studies, and undoubtedly 
we will continue to learn from them. When we 
can complement correlational studies with 
functional studies, however, we shall ad- 
vance our understanding of dreaming much 


more rapidly. A 
is to 


The purpose of the present paper 15, 
review and bring together for the first time 
hich an experimental at- 


those studies in w 
influence either the content or the 
yrnal dreaming was made, in 


what possibilities they sug- 
gest for functional research on dreaming. 

~The past decade has seen an enormous in- 
crease in our knowledge of sleep and dreaming 
resulting from application of EEG measures, 
and a brief summary of this body of results 
will be presented in order to provide an up-to- 
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date background for assessing the studies to 
be reviewed herein. More detailed reviews 
and evaluations of the more than 75 studies 
contributing to the past decade’s new knowl- 
edge of dreaming have been published else- 
where (Kamiya, 1961; Kleitman, 1960, 1963; 
Oswald, 1962, 1964; Snyder, 1963). 

Sleep will be defined in this paper as a 
state of the organism indicated (in human 
subjects) by one of the four EEG stages 
(Dement & Kleitman, 1957a, 1957b). The 
Stage 1 pattern consists of an irregular mix- 
ture of theta waves (4-8 cps), alphoid activity 
(waves of 1 to 2 cps slower than the subject’s 
waking alpha), and occasional alpha waves 
(8-13 cps). Stage 2 contains spindle activity 
(14 cps) in addition to the above, and 
Stages 3 and 4 contain increasingly larger 
proportion (up to 100%) of delta waves (1-3 
cps, high amplitude) in addition to spindle 
activity. The exact division between Stages 2, 
3, and 4 are arbitrary, based on the percentage 
of delta waves in given epochs. The Stage 1 
pattern is readily distinguishable from the 
other stages by its total lack of spindles and 
delta waves. 

Stages 1 through 4 were initially conceived 
of as comprising a continuum from “light” to 
“deep” sleep (Aserinsky & Kleitman, 1953, 
1955; Dement, 1955), but as other measures 
of the “depth” of sleep contradict this con- 
ception (Berger, 1961; Hawkins, Puryear, 
Wallace, Deal, & Thomas, 1962; Kamiya, 
1961; Snyder, 1963; Williams, Tepas, & Mor- 
lock, 1962), this paper will treat sleep as 
being of two qualitatively distinct types, 
namely Stage 1 as one type and Stages 2, 3, 
and 4 as the other type. Distinctions between 
Stages 2, 3, and 4 will not be made in this 
paper, and they will be collectively referred to 
as Nonstage 1 sleep. 

If subjects are awakened from the two 
types of sleep and asked to report what they 
have been experiencing, the reports may be 
classified into two rather distinct types. One 
type, awakenings from Stage 1 sleep or shortly 
(within, roughly, 10-15 minutes) after Stage 
1 sleep has changed to Nonstage 1 sleep, 
possesses the characteristics traditionally as- 
sociated with the experience of dreaming 
(Foulkes, 1962; Rechtschaffen, Verdone, & 

Wheaton, 1963). Reports from Nonstage 1 
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sleep seem more like “thinking,” and ag 
generally called thinking by the subjects 
these same subjects generally refer to k. 
Stage 1 experiences as dreams. The psycholo E 
ical differences reported so far are quant 
tive, rather than being completely dicho 
mous, but generally give the impression 
distinct types of experiences. nied 
Stage 1 sleep is almost always accomp ve- 
by binocularly synchronous rapid eye mea 
ments (REMs), and the evidence is very “y 
vincing that these are closely associated ar 
the content of the dream, if not actual oe 
ning movements of the dream imagery (ee i 
& Oswald, 1962a; Dement & Wolpert, ; 1) 
Roffwarg, Dement, Muzio, & Fisher, 
Such REMs have not been reported in 
stage 1 sleep, although there are some 
rolling movements (Kamiya, 1961). ing 
In view of these findings, the jollon a 
theoretical position will be taken in this r me 
as an aid in surveying the literature 0" eri 
tional studies of dreaming: that an exp 
entially distinct type of phenomenon at 
concurrently with the presence of rag 
sleep, which phenomenon will be called aen 
1 dreaming, or just dreaming. The om be 
phenomena of Nonstage 1 sleep will g 
considered in this paper. Further, i ail 
sumed that the experience of Stage 1 vo ce 
is essentially continuous ? during the Pe cah 
of Stage 1 EEG, whether the subje pot 
always recall this experience on waking ipio 
This position is, in the author’s op gei 
supported by all the studies using nr by 
and REM technique, and directly refu 
none. „eam; 
The term “process” of Stage 1 anny, 
is used to indicate the ongoing psy© T ‘4 
ological activity which is observed 0” ee 
ological level as the Stage 1 EEG patte nce pi 
on an experimental level as the expert i jet 
dreaming (observed directly by th? = bi 
and indirectly via the dream raport g 
terms “dream content,” “reported dre? 66 


slow 


a, 


2? Within a continuous period of Stage 
the content of the experienced dream nats 1 
divided into several distinct episodes 5°, 
sense, there are several distinct “dreams 
mo 
2 dy 
topic may be accompanied by a gross oe 
ment on the part of the subject. 
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simply “dream” refer to the particulars of 
the experience. 
are subjects, Stage 1 dreaming and 
fashion s i sleep alternate in a regular cyclic 
Asa ibe erred to as the sleep-dream cycle. 
tief (a falls asleep there is generally a 
period of y seconds to a minute or two) 
jects’ rep tage l, without REMs, but sub- 
a petiod me indicate that this is apparently 
typical Aes hypnagogic imagery, rather than 
Oswald an (Dement & Kleitman, 19578; 
interval: tak At approximately 90-minute 
tage 1 pi rough the night there are periods of 
eing lo reaming, each dream period generally 
age aer than the preceding one. The first 
ourth Rao may last for 10 minutes; the 
Minutes Fin one may last as long as 
Cupies bet Itogether, Stage 1 dreaming 0C- 
ime of ween 20 and 30% of the total sleep 
9 six ? em young adults, spread over three 
centage wee 1 periods. While the exact per- 
Cles Saria dream time and the number of cy- 
Subject m Ton subject to subject, for a given 
Wite gt e sleep-dream cycle is generally 
1 oe from night to night (Dement, 
Wee ee, 1957b; Kleitman, 
Wi ae 
Sime te in mind of Stage 1 dream- 
Sleep, let g regularly through every night 
Ve att us now review those studies which 
these st €mpted to affect dreaming. Many of 
Very ies were carried out before the dis- 
Precision he sleep-dream cycle and s0 lack 
e st ut do offer some data. 
venije udies will be reviewed under three 
A , ee but not mutually exclusive head- 
at ely: (a) variables affecting the con- 


Proce opie (b) variables affecting the 
a sleep, ne and (c) discrimination 


ian) 
e , 
S Affecting the Content of Dreaming 


atiah 
Yes coles affecting dreaming are of We 


‘n 
Me sleep, rar those applied before the onset 
dig? ad q those applied concurrently with 
T i Mg: The latter type will be 
atu ; 
ga í =a occurring stimuli, both external 
Con, PS falli (e.g., thunder, breezes, stomac 
arg ated ing out of bed), are sometimes in- 
€ man, to the content of dreams. There 
anecdotal accounts of this 1 


bh 


the older literature (Ellis, 1911; Lhermitte, 
1931; Schatzman, 1925; Spitta, 1882; Sully, 
1905; Thompson, 1914; Walsh, 1920), some 
of which are cited by Freud (1954; i.e., works 
by Hennings, Hoffbauer, Jessen, and Meier). 
Such accounts tell us little about the effec- 
tiveness of stimuli in modifying dream con- 
tent, however, as they provide no baseline of 
how many naturally occurring stimuli were 
not incorporated, nor do they provide any 
systematic data on the manner in which the 
incorporated stimuli were represented in the 
dream. 

De Sanctis and Neyroz (1902), Max 
(1935), and Vold (1912) experimentally pre- 
sented stimuli to sleeping subjects, as did 
Claviere, Hildebrandt, Maury, Stepanow, 
Straecke, Vaschide (cited by Ramsey, 1953), 
and Menuier (cited by Ellis, 1911). They 
also demonstrated that stimuli are some- 
times incorporated into dreams. Despite great 
variation in method from study to study, 
there seems to be a consensus in these stud- 
ies that tactual stimuli are more effective 
in affecting dream content than stimuli in 
other sensory modalities. 

Both the anecdotal and early experimental 
were, in the light of what is now 
the sleep-dream cycle, beset with 

of precision, as data con- 
sleep and Nonstage 1 sleep 
ately mixed together. A recent 
study (Dement & Wolpert, 1958), using the 
EEG and REM technique, has greatly in- 
knowledge here. Briefly, a pure 
in 99% of the trials, a 
ht in 23%, and drops of water in 
actual stimulus, this 
older studies. When 
ed during Nonstage 
dreaming was not initiated, 
d idea that dreaming was 
and when the subjects 


studies 
known about 
an inherent lack 
cerning Stage 1 
were indiscrimin 


A Tat 
42%. AS the water 1s 2 
is in agree 


e names we 


j had important, emotional rela- 
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tionships, while others were neutral in this 
respect. Incorporation was objectively judged 
by having both the subjects and an indepen- 
dent judge later try to match reported dream 
content with stimulus names. The stimulus 
names were matched with the correct dream 
reports with considerably better-than-chance 
accuracy by both the subjects and the in- 
dependent judge. The emotionally meaningful 
and the neutral names were incorporated with 
equal frequency. Incorporation was not as- 
sociated with detectable EEG changes, or 
GSR responses. Berger reported the incorpora- 
tions as falling into several descriptive cate- 
gories, in order of decreasing frequency, as 
follows: (@) assonance, where a dream ele- 
ment or word sounded like the stimulus name, 
a sort of “clang association”; (b) direct 
incorporation, where the dreamer reported 
hearing the stimulus name in the dream; (c) 
associational incorporation, where a relatively 
direct personal association to the stimulus 
name appeared in the dream; and (d) rep- 
resentational association, where the stimulus 
name was represented or symbolized by an 
object or action in the dream. 

There have been some studies of the in- 
corporation of stimuli into hypnotic dreams 
(Benussi, 1927; Klein, 1930), but as there 
is strong evidence that hypnotic dreams may 
be quite different from Stage 1 dreams (Bar- 
ber, 1962; Tart, 1964b, 1965) they will not 
be reviewed here, 

Some experimenters applied various stimuli 
to their subjects before the latter went to sleep 
(Cubberly, 1923; Fisher, 1954, 1960; Mala- 
mud & Lindner, 1931; Monroe, 1899; Poetzl 
cited by Ramsey, 1953; Renshaw, Miller, & 
Marquis, 1933; Vold, 1912) and found that 
the stimuli sometimes appeared in the content 
of dreams reported the following day, often 
in “distorted” or “disguised” form. Such 
studies are difficult to evaluate, as there is a 
contaminating factor of waking suggestion 
to dream about the stimuli implicit in the 
design of stimulating subjects and then asking 
them to record their dreams. More serious 
difficulties in evaluating these studies arise 
from the heavy demand characteristics (Orne, 
1959, 1962) undoubtedly operative as a re- 
sult of the psychoanalytic orientation of many 

of the experimenters. About the most that 


can be concluded from these studies is that 
subjects will bring in “dreams” which seem 
to incorporate the stimulus material in dis 
guised or distorted form, but this fact dots 
not necessarily prove the validity of psyche 
analytic theories of dream formation. The 
studies are also subject to other criticism 
discussed below. 

A number of studies apparently de 
strated that posthypnotic suggestion C0" 
control the content of nocturnal dreaming i 
a high degree (Fisher, 1953; Nachmans0™’ 
1951; Newman, Katz, & Rubenstein, 16 
Schrétter, 1951), but as they did not use Jle! | 
monitoring of sleep, they were not contro te 
for the possibility that the dreams aie 
were hypnotic dreams interspersed with sle t 
rather than ordinary nocturnal dreams. °, 
such an event can happen, without the iy 
being aware of it, has been shown 12 o 
studies (Barber, 1962; Schiff, Bunney», 
Freedman, 1961; Tart, 1964b). These s 
also are ambiguous in that effects on pre 
dreaming and Nonstage 1 mental ach of 
would be indiscriminately mixed, and moe de 
the early Stage 1 dreaming of the night ub 
undoubtedly forgotten by the time the ° 
Jects awoke in the morning. pe 

Stoyva (1961), using the EEG and on 
technique, was the first to demonstrate eal” 
vincingly that the content of Stage ! d 
ing could be influenced and manipulat? py 
a very high degree in some subjects bY it of 
of posthypnotic suggestion. The conte 
Nonstage 1 mental activity was also & jn 
although not as markedly, Tart (19640 M 
dependently confirmed Stoyva’s findin® of 
with Stoyva’s subjects, the Stage 1 drea? i 
some subjects were not affected at all, yall 
those of other subjects were almost * sste 
controlled by the posthypnotically suse jid 
content, Stoyva also found that oy * 
waking suggestion affected dream conten 33) 
had been reported earlier by Fisher a 
and Titchener (1895), but not as str 
posthypnotic suggestion. : 

Wood (1962) found that social Í n 
for a period of 1 day affected the cont? g 
Stage 1 dreaming occurring that nig ra w 
subjects seemed to be compensating "%7 y 
isolation by increased “social intercoU™ 
their dreams, 
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ee laboratory situation itself seems to 
ciall ae effect On Stage 1 dreaming, espe- 
ibi waen psychiatric patients are used as 
die (Whitman, Pierce, Maas, & Bald- 
a e A 962). Feelings about the experimenter 
the re out being experimented on appeared in 
neno dream content, often in “dis- 
Dere oe However, it has been reported 
this effec Kahn, & Roffwarg, in press) that 
niekis te diminishes rapidly after a few 

ah cae laboratory, with normal subjects. 
cerned % of first-night dreams were con- 
Tectly, Reg the experimental situation di- 

eva i this fell to 9% for later nights. 
investigat studies have been carried out to 
team ¢ e the effects of various drugs on 
and oer for example, chloropromazine 
(W TAR (Lesse, 1959), meprobamate 
imipramine’ 1960), and 
3 mine 


i 
Stu 


tawn yet 
seem Present, then, posthypnotic suggestion 
method f We the most powerful and precise 
Its Use į or affecting dream content, although 
The rg restricted to a minority of subjects. 
mi of external stimuli presented to the 

Ng subject offers promise of strongly 


affect: 
cti. 3 
emai Ng dream content, but much research 


n . 
ERG ppl? be done in this area, using the 


nel ty M technique, to discover the opti- 
m, Pes Of stimuli and ways of presenting 


n 
riab ; 
les Affecting the Process of Dreaming 


Hn ig tte two powerful techniques which 
dre ps almost total disruption of the sleep- 
‘om eae The first is total sleep depriva- 
ip ch has received considerable atten- 


n 
Brangi ot years (Berger & Oswald, 1962b; 


tise West, 1959; Luby, Frohman, 
Grisen enzo, & Gottlieb, 1960; Luby, 
1962. y: 0hman, Lees, Cohen, & Gottlieb, 
: Gotti Y & Cardon, 1962; Rodin, Luby, 
ce ten 1962; Simon, 1961; West, Jans- 
Grang stet; Cornelisoon, 1962; Williams, 


Wanda, : 
Vitam, 026s, Lubin, & Armington, 19625 
964. w Hammack, Daly, Dement, & Lubin, 

iliams, Morris, & Lubin, 1962). 


The second is selective deprivation of certain 
aspects of sleep, Dement’s (1960) ‘“‘dream- 
deprivation” technique where subjects are 
awakened whenever a Stage 1 pattern appears, 
and Stage 4 deprivation (Agnew, Webb, & 
Williams, 1964) carried out in the same 
manner. 

Dement found that if, on several consecu- 
tive nights, a subject was awakened whenever 
Stage 1 dreaming began, the subject would 
have to be awakened more and more fre- 
quently to prevent Stage 1 dreaming as the 
nights went on. Furthermore, if the subject 
was allowed several undisturbed nights of 
sleep following this procedure, his Stage 1 
dream time was significantly increased for 
several nights. Control awakenings scattered 
randomly through Nonstage 1 sleep produced 
no such effect. Dement concluded that there 
was a need for a given amount of Stage 1 
dreaming each night, although whether this 
is a need for the experience of dreaming or the 
physiological state represented by Stage 1 
EEG is unknown. Agnew et al. (1964) fol- 
lowed the same procedure for Stage 4 of 
sleep, although the stimuli given their sub- 
jects whenever Stage 4 sleep started often 
served to shift them to a different stage of 
sleep without awakening. They found results 
similar to Dement’s, that is, more and more 
need to stimulate the subjects to get them out 
of Stage 4 as the deprivation nights went on, 
and an increase in Stage 4 sleep time on 

ts following this procedure. 


undisturbed nigh ing t 
As both the sleep-deprivation and dream- 


deprivation procedures represent a total dis- 
ruption of the sleep-dream cycle, rather than 
a selective effect on dreaming per se, they 
will not be further discussed here. 

The laboratory situation eo Besani 
ream process, just as 1 s 3 
ae On a subject’s first be > E 
laboratory he is likely to miss his firs a 
period of the night. This piama n 
been called the “first-night effect : E ; 
1955; Dement et al., in press; a hei 

& Verdone, 1964; Snyder, 19 J, a 4 
i + customary among dream researchers 
oo shale i t night count only as 


ve a subj ‘ 
n p night, rather than taking any 
precise measurements on it. 

Social isolation also affects the process of 
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Stage 1 dreaming. Wood (1962) found that a 
day of social isolation resulted in a 60% 
increase of Stage 1 dream time for the first 
two dream cycles of the experimental night, 
while the number of REMs/minute during 
this dreaming was decreased. P 

Gresham et al. (1963) studied the effect of 
alcohol and caffeine on Stage 1 dream time, 
and found that alcohol produced a significant 
decrease in Stage 1 dream time (a mean de- 
crease of 22 minutes for the first 5 hours of 
sleep), while caffeine, in the dosage used, had 
no effect. Rechtschaffen and Maron (1964) 
found that a mixture of dextro amphetamine 
sulfate and pentobarbitol sodium may halve 
Stage 1 dream time in many subjects. Dement 
(cited in Snyder, 1963) found that sodium 
amytal may also halve dream time. Heptabari- 
tone (Oswald, Berger, Jaramillo, Keddie, 
Olley, & Plunkett, 1963) also reduces Stage 1 
dream time. 

Tactual stimuli, in the form of the hardness 
of the surface on which the subject sleeps 
(a feather bed at one extreme and a board 
at the other) have been reported to affect 
sleep (Suckling, Koening, Hoffman, & Brooks, 
1957), but the data are not presented in a 
form which allows careful assessment of the 
effect on Stage 1 dreaming. 

Stoyva (1961) found that posthypnotic 
suggestions intended to influence Stage 1 
dream content also affected the duration of 
Stage 1 dreaming. In those of his subjects 
whose dream content was generally influenced 
by the suggestions in every dream of the 
night, Stoyva found a statistically significant 
decrease in Stage 1 dream time for the early 
part of the night, what he termed the “short- 
ening effect.” For the first two Stage 1 periods 
of the night, this effect w. 


5 as an average 7- 
minute decrease, 21% with a maximum de- 
crease of 14 minutes, 


Offering subjects money if they can dream 
more or less on different nights also has some 
effect (Rechtschaffen & Verdone, 1964), al- 
though the difference between the two condi- 
tions was only an average of 6.6 minutes per 
night (3% compared to base-line nights). 

A particularly interesting problem is that 
of exerting control over waking from sleep. 
Ladd (1892), speaking of whatever “mechan- 
ism” caused sleep, reported that “I ‘set? 


this mechanism so that it will dip down n 
sleep and dream-life, with a gradual cui 
ture, as it were, and then come out of mi 
life in an instant, i.e., by a steep curve. A 
provides no other information as to the nan 
of this sleep “mechanism” or how he Te 
about “setting” it, although one might su! A 
that he used some form of autosuggesti®" 
Nor does he report how successful this E 
nique was, although he mentions that it f 
more successful for arousing himself shor 
after going to sleep than for late in sena 4 
A very intriguing report on waking hi 
sleep is that of Frobenius (1927), Yia 
shows that subjects can wake from sleep * | 
randomly preselected target time on jth 
attempts. His subjects usually awoke W thet 
10 minutes of the selected time, and felt ra ne 
disoriented. They had no idea as to how |, 
had awakened, nor did the content 0 si 
dreams seem to reflect a waking proce” pe 
any way on those occasions when they 4”, 
from dreams, Elder (1941), in a brie’. 
stract, reports that he had confirme 


aun at 
least one case (Nelson, 1888). ae wi 
(who was his own subject) complai” es! 
his sleep was no longer refreshing 2° mpg 
of waking too frequently, so that he d om 
his interest in dreams in order tO orts o 
rest! Renneker (1952a, 1952b) rep° omj 
patient who habitually awoke a few 
before the alarm was to go off, supP 
order to prevent his dreams from 
anxiety filled, and presents some e of p 
analytic speculations about the natt” 3) v0 
ehavior. Omwake and Loranz ( : 1 
Brush (1930) also report data wi 


1964 
*J. Elder, personal communication, 1? 
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little doubt that some subjects can awaken at 
4 preset time, 
a Most recent study in which the process 
(1963). Ue was affected was that of Tart 
Working Using posthypnotic suggestion and 
found Senne with two subjects, it was 
awaken at the subjects could be made to 
their St at either the beginnings or ends of 
Of this S 1 dreams, although the accuracy 
ificulti ehavior was difficult to judge due to 
complian in defining criteria for “perfect” 
eniug’ nce with the suggestions. As in Fro- 
idea ag (1927) study, the subjects had no 
reams araw they had awakened, and their 
isci id not seem to reflect any processes 
mination or waking. Tart also found 
jects ca ea 1 dream time of one of the sub- 
of Dosth be increased about 20% by means 
t YPnotic suggestions to dream all night. 
Woul T eseat, then, posthypnotic suggestion 
‘or affe em to be the most powerful technique 
Selec vere the process of dreaming which is 
y to 7 its effect (although it is applicable 
a minority of subjects). Total sleep 
u É or dream deprivation is more 
a ut nonselective as it disrupts the 
a we are trying to manipulate. In 
the Psychological variables (sugges- 
ybnosis, laboratory setting, etc.) seem 
moyne MY a small effect on the actual 
°F time spent in Stage 1 dreaming. 


a 
gation during Sleep 
Westice and Emmons (1956) played recorded 
n oy mee answers to their subjects once 
he akeg S while they slept. The subjects 
td e to call out their own names if they 
ti Awe ee aston and answer. The subjects 
ks ang tially no responses to the ques- 
heq then Wers and no evidence of having 
St, Morning) (judged by questioning them in 
ne le whenever they showed Non- 
RI are €P patterns. The results of this 
wE l Datt ficult to interpret in terms of 
A arous erns, however, as they were mixed 


0 
yltaa Patterns, 
wie deprive Hammack (1961) trained 
a ad ona oa Subjects to press a micro- 


ho, xiong S certain schedule in order to avoid 
K, the “Ctttical shock. By avoiding the 
Subjects could sleep. They found 
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that their subjects could produce the required 
avoidance behavior in all stages of sleep, with- 
out showing signs of EEG arousal. Williams, 
Morlock, and Morlock (1963) extended this 
work and found that the subjects could dis- 
criminate between several tones, but that they 
tended not to respond during Stage 1 dream- 
ing unless the negative reinforcement was 
quite strong. 

Li, Jasper, and Henderson (1952) report 
that a narcoleptic patient of theirs could 
sleep through the noise made by the experi- 
menter hitting a brass pail right beside her 
bed, but would awaken immediately if her 
name was softly spoken. Oswald (1962) car- 
ried out two studies of subjects’ reactions to 
meaningful versus nonmeaningful stimuli dur- 
ing sleep. In one study, subjects showed more 
muscular responses to the sound of their own 
names and that of another designated name 
than to other, nonmeaningful names, although 
this response was usually accompanied by 
partial EEG arousal. In a second study, a 
series of first names was played repeatedly 
to subjects while they slept, but some of the 
names had been tape recorded backwards, 
making them meaningless sounds. The sub- 
jects showed more EEG responses without 
waking (K complexes) to the meaningful 
sounds. A similar finding for tones, one of 
which had been associated with a noxious 
stimulus, was reported in cats (Rowland, 
1957). Berger (1963), however, did not find 
any difference in frequency of ‘corpore g 
into the reported dream content of names o: 
persons with whom the subjects had emotional 
bonds versus emotionally neutral names. 

Thus the literature indicates that m 
«¢ and dreaming subject is not behaviorally 
ing a but may make, at least, some fairly 
inert, grecriminations and responses, some- 
simple acl by momentary awakening, 
earner within the sleep or dream state 


itself. 
D Discussion 


ttle doubt that, in light of the 
ies, the conclusions of 
i i ill be revised 

7 he earlier studies wi i 
ae eae of future research come In. And 
a Pate certain methodological problems to 
age even with the use of the EEG-REM 


There is little 
past decade’s discover 
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technique.* In spite of these drawbacks, some 
conclusions may be drawn from the studies 
herein reviewed. 

Sleep is not a state of behavioral inertness 
(a tradition long given implicit support by 
the extreme paucity of psychological studies 
of it). On the contrary, not only is a sleeping 
person having vivid experiences, dreams, 
periodically through the night, but (a) he is 
capable of incorporating some external stimuli 
into the content of his dreams, and (b) at 
least some subjects are capable of sometimes 
altering their state of consciousness from 
sleeping (or dreaming) to waking, either by 
means of a discriminative response to external 
stimuli or by using the occurrence of dream- 
ing itself or a given time of night as a 
stimulus. Furthermore, at least some sleep- 
ing subjects are able to carry out some rela- 
tively simple motor acts in response to 
external stimuli. 

At present then, we know only that some 
subjects can make responses during sleep, 
make discriminations, incorporate stimuli, etc. 
But it should be emphasized that the exact 
percentages of subjects, the exact behaviors 
observed, or the methodological difficulties of 
these studies are not the most important 
things to be noted in this review. Rather, 
the fact that any subjects can carry out these 
sorts of behaviors during sleep and dreaming, 
no matter how imperfectly at present, is the 
important thing, for it is highly probable that 
what has been done imperfectly by some sub- 
jects, using crude, exploratory methods, can 
be done far better by other subjects once 
these techniques are refined and developed. 
The immediate research need is not to show 
simply that some subjects can have their 
sleep and dreaming affected in these manners, 
but to investigate how well subjects can be 
trained to do these things, and what the maxi- 
mal level of performance and effect is, Then 


4In Tart’s (1963) study, where subjects were 
supposed to wake from either the beginnings or 
ends of their dreams in accordance with posthypnotic 
suggestions to that effect, difficulties arose in evalua- 
ting the results because of a number of responses in 
which the subjects seemed to be complying on an 
experiential level with the suggestions, but could 
not meet strict criteria for the beginnings and ends 
of their dreams in terms of the amount of time 
Stage 1 sleep had lasted. 


we shall possess a number of techniques ie 
actively manipulating dreaming, leading M 
results which can only be speculated a 
now (Tart, 1964a), but which will alm 
certainly surprise us. + fa 
The most immediate need in this area sti 
extensive studies of stimulus incorporati 
and of the potentialities for discriminative th, 
sponses to stimuli, including experientia 7 
criminations within the dream, wakinmi 
sleep at preset times, and overt moral 
sponses. How do various sensory M0 of 
differ in the way in which stimuli are ine 
rated into the dream? Are stimuli in 
modalities more distorted than m 9 
Can the degree of direct incorporation si 
distortion, be affected by various expe" ora 
techniques? Can subjects learn to aeon o 
some stimuli effectively enough 50 + gio 
could use these stimuli as “signals” to a t 
to the subject that he should catty out i 
specific action, for example, dream = at 
Particular topic, or carry out some MOP, jy! 
What sorts of motor acts can be carrie nf 
There are simple ones, such as raising F o 
to indicate that he is dreaming, of ee ww 
experiencing anger within the dream, bs 0 
about the possibility of more comple site 


f 


5 i) 
thet 
ib 


eae rT 
acts, say, automatic writing, or ger a 
Ultimately, to what extent could evel, 
way communication system” be [y 


: cy 
whereby the experimenter could instr! re 
subject to do such and such while he s ev, 
ing, and the subject could report on ving 
of the dream while they are OCCU% qye, 
such a development were possible, jee 
would lose their status as a purely ro i 
event that can only be reported in TC” or! 


i e 
and become a much more immediat i 
behavior. 


things about. Judging from the pres ay 
of the literature, posthypnotic pr pele 
seems to offer the most promise, pers 

this method nor any of the ee. 
really been adequately tested a” ran! “4 
What potentialities are there in Fi si” 

ditioning techniques, effects of inc" 
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wee combinations of drugs, etc.? 
Th out combinations of these? 
aa Es reviewed herein, then, do not 
Sirk = ia of knowledge about this area 
ted red serve a far more important 
be a p ee ne ie 
tiene thie fascinating possibilities for 
ch and development. 
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A GENERAL MODEL FOR THE STUDY OF 
DEVELOPMENTAL PROBLEMS? 


K. WARNER SCHAIE 


West Virginia University 


Various research designs employed in developmental 
investigation of maturational and aging effects are exami 
discrepancies and contradictions in the conclusions deriv 


psychology for the 
ned. It is found that 
ed from cross-sectional 


and longitudinal studies are consequences of the violation of assumptions 


implicit in these research designs. The conventional methods are show 
special cases of a general model for research 

The properties of the general model are expli 
the customary designs are reviewed in the light of t 
plete model requires consideration of the component: 
differences in the identification of developmental cha: 


and cross-sectional methods in this 
can rarely be met. 


It is the purpose of this paper to examine 
various research designs in developmental 
psychology with a view to resolving some of 
the contradictory findings reported in studies 
employing different designs for the investiga- 
tion of aging effects. In the course of this 
examination it is proposed to identify certain 
inherent inadequacies of the conventional 
longitudinal and cross-sectional appro: 
It will be shown that these methods are r 
special cases of a general model for re 
on behavior changes over time. The 


aches. 
merely 
search 


1 under- 
lying assumptions of this model will be ex- 
plicated and several new and more powerful 


research strategies will be 

Kessen (1960), 
of research design 
lems, suggested tha 
age functions is nai 
meaningful design 


developed. 

in his excellent discussion 
s for developmental prob- 
t the investigation of simple 
ive and proposed that most 
s involve either the inter- 
action of age and environmental changes or 
the analysis of age functions in special popu- 
lations. Only in Passing did he mention the 
even more typical case where aging interacts 


1 Parts of this paper were read at a s 
Research Design in Developmental 
the Los Angeles meetings of the American Psycho- 
logical Association, September 1964. Preparation of 
this paper was supported in part by a Public Health 
Service research grant No. HD 00367-02 from the 
National Institute of Child Health and Human De- 
velopment. 
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n to be 
on behavior change over time. 
cated and the assumptions for r 
hese properties. The com- 
s of age, time and cohort 
nge. Both the longitudinal 
strong assumptions which 
therefore suggested which 
and longitudinal methods 
ptions are presented both 
and the repeated measurement cases. 


latio” 
simultaneously with differences in POP outs 
and environment, Special cases may, A sensio? 
be derived where one or the other = usel 
is ignored. While such analyses may oa pl | 
and perhaps often present the only ei tH 
practical approaches, their limitation ™* ogo 
noted and their relation to the genera ia! | 
well understood.? pent” 
Kessen proposed that in any evelor a 
problem age-function relationships at nc 
tablished by cross-sectional study in pop! 
where it is possible to segregate a igat? i 
tions across the age range being ndie y 
He also suggested, however, that 5 chansi 
variables showing small but veliaht rates, g 
the analysis of sequences and 0 peha y 
change, and the follow-up of early = lt 
which are not found in the repertory qudi, 
organisms will usually require the = dist” | 
approach. It will be shown that suc case a 
tions do not apply to the general pas rs 
that their application to special cases pti 
been the result of misunderstood ass" oe 
What then are the attributes of ae 
developmental model from which b° an 
sectional and longitudinal methods 


AS 


in P 
106-107) discussed "id Ext 
“trapezoid” model which ptatiow D 
atical basis for our prese out * 
owever, has been worked 


2 Horst (1963, pp. 
the need for a 
vide a mathem 
such model, h 
date. 
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derived and from which their relationship can 
be clarified? A beginning may be made with 
Kessen’s formulation that “a characteristic is 
Said to be developmental if it can be related 
to age in an orderly or lawful way.” Proceed- 
ing further, Kessen’s most general definition 
of a Tesponse in a developmental model will 
© accepted. In this context he specified that 
: response may be seen as being a function of 
Ne Special population, and environment. Such 
deat ee will be made even more gen- 
whic owever, by recasting its content in terms 
ich will be found more fruitful for further 
analysis, 
sa Component of “age” will, of course, 
ioten the central ingredient, but it will be 
a o defined to mean the age of the 
response at the time of occurrence of the 
cisely o, 10 be measured. Even more pre- 
Units stare will refer to the number of time 
k lapsed between the birth (entrance into 
; “Nvironment) of the organism and the 


Point jn 43 
wn time i response is re- 
Cordeq at which the resp! 


Ris Component of “special population” will 
will ae by the term “cohort.” This term 
born ply the total population of organisms 
Rests Mhe same point or interval in time. 
tion Ctions as to the nature of the popula- 
in hae the latitude in defining the interval 
von designated as being common to a given 
Specia] will, of course, be introduced by the 
lem ș ;{8Sumptions appropriate to the prob- 
e investigated. 
€ term “environment” will be replaced 
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by the concept of time of measurement. In a 
most general sense the investigator is not 
really concerned with describing the nature of 
the total environment but rather with index- 
ing the total environmental impact as occur- 
ring at a given temporal point. Such an index 
of the state of the environment is clearly pro- 
vided by indicating the time of measurement 
(see Kuhlen, 1940). 

Our general developmental model will now 
hold that a response is a function of the age 
of the organism, the cohort to which the or- 
ganism belongs, and the time at which mea- 
surement occurs. It will shortly be seen that 
these three components are by no means inde- 
pendent. Their confounding, in fact, dictates 
the methods available for their separate and 
joint analysis. 

Consider next the temporal boundaries of 
the general developmental model. If the hu- 
man life-span is assumed to be bounded by a 
100-year limit, one can specify the total time 
required to measure all cohorts at all ages. 
The time required is simply twice the abso- 
lute life-span of the populations under study. 
Table 1 gives an example, where cohorts have 
20-year intervals and measurements are taken 
also at 20-year intervals, for a total develop- 
mental study of a population existing in 1960. 
It will be seen that this model permits, over 
its total 200-year span, several independent 
] studies but only one cross-sec- 
y which includes members of all 
r investigation. Note that only 
el can be studied unless retro- 


longitudina 
tional stud 
cohorts unde 
half the mod 


TABLE 1 


AGES oF 20-YEAR COHORTS ^ 


T 20-YEAR INTERVALS FoR Col 


AVAILABLE TO AN INV! 


ORTS 


ESTIGATOR IN 1960 o 


Age 

0 20 40 60 80 im a = = = = 

= 20 40 60 8 i m = = 7 

: w a 5 100 — = 

=s = 0o m 10 60 30 = 

= = == a a 4 6 50 10 
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spective measures are available, or unless a 
future study is planned to begin with the 
cohort born at the first time of measurement. 

Although the complete model can be studied 
with short-lived organisms, it will generally be 
most interesting to work with all the cohorts 
available at any given point of measurement. 
A general developmental model will therefore 
be described which will hold for that part of 
the complete life-span of the population which 
we wish to investigate. To give this model 
general applicability it is required that each 
age level be investigated for each cohort and 
at each time of measurement within the con- 
fines of the model. 


GENERAL DEVELOPMENTAL MODEL 


An example of a complete model is depicted 
in Table 2, which gives the ages of cohorts 
studied over the age range from 5 to 9 years, 
measured at annual intervals. In general, if 
the intervals at which a given characteristic 
is to be measured are called times of measure- 
ment (T), then 2T — 1 cohorts with interval 
latitude equal to the time interval will be 
required for a study of the complete model. 

Let us assume for the moment that the 
characteristics to be investigated may be found 
only at the times of measurement and for the 
cohorts (Cs) listed in Table 2. Let us also 
assume that scores for an independent ran- 
dom sample of each of the cohorts are avail- 
able at each of the measurement times. Let us 
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further assume that all groups have had equal 
amounts of practice on the characteristic a 
be measured. One of the striking features 0 
Table 2 is the fact that it is impossible t° 
measure all cohorts at all ages or to obtaim 
measures for all ages at each time of measure 
ment. The model can, however, be sectione 
horizontally (as indicated by the line ™ 
Table 2) to obtain a subset containing meas 
ures at all ages for all cohorts. Similari; 
by vertical sectioning a subset can be ei 
tained for which all ages are measured at 4 
times of measurement. This means that 48° 
is always confounded with either differences 
between cohorts or with differences betwee? 
times of measurements. In the general mode i 
it is possible to estimate age changes eo 
founded in either manner, depending wa 
the question we are interested in, or, aS will i 
shown, one can obtain estimates of net a 
differences whenever certain assumptions ar 
tenable. 5 
Examining Table 2, it can readily be se% 
that differences in mean scores between C° 
umns in the vertically sectioned subset W! 
represent differences in time of measureme?” 
that is, differences due to commonly expe! 
enced environmental variables or changes ? f 
cultural context. Likewise, the differences 4 
row means in the horizontally sectioned SU" 
set will represent differences between cohort’ 
that is, the effect of different life experienc? 
previous to the time of measurement or 8% 


TABLE 2 
AGEs OF COHORTS MEASURED AT Five AGES WITH ANNUAL MEASUREMENT INTERVALS 


Ages 


Time of 
measure- 
ment 


1960 


1962 1963 


1965 1966 1967 1968 


A 
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netic changes from one cohort to the next. 
For each subset it is possible also to compare 
means for the diagonals. Differences between 
means here involve age differences, which are 
controlled for time of measurement but con- 
founded with cohort differences in the vertical 
subset, and controlled for cohort differences 
but confounded with time of measurement 
differences in the horizontal subset. 

A clearer understanding of the implications 
of the model may be obtained by deriving the 
various methods therefrom for the study of 
developmental change. These include the con- 
ventional cross-sectional and longitudinal ap- 
proaches as well as a third, frequently over- 
looked but equally important, method which 
will be identified as the time-lag method. 


Relation of the General Model to 
onventional Designs 


Each of the first five columns in Table 2 
represents an example of the conventional 
Cross-sectional method. The hypothesis to be 
investigated simply asks whether there are 

ifferences in a given characteristic for sam- 

Ples drawn from different cohorts (i.e, born 
at different times) but measured at the same 
time. This is an important question, partic- 
ularly for the student of individual differences. 
fe e differences in behavior at a given point 1n 

me may require differential procedures re- 
sre of the antecedent conditions respon- 
ible for these differences. It must be recog- 
a however, that age differences in the 

SS-sectional design are inextricably con- 
Ounded with cohort differences and that it 15 
bees unreasonable to expect that the dif- 
will he between Cohorts ¿ and j at Time i 

etna e anything to do with the differen 

Ween the mean score of Cohort # at Time 

and its mean score at Time l. The fact that 

op empora] interval in years of birth between 
inter rts i and j is identical to the temporal 
is val between Times & and l for Cohort i 
ceva coincidental and irrelevant. It is con 
a le that the difference in means between 
p at 5- and 10-year-olds could be rep- 
for ing be of the difference between the men 
age 10 S-year-olds at age 5 and the mean at 

en if there were no shifts in the environ 

t and content of our culture. But mans 


miyi . . 
ronment does change, and projections 


over time based on cross-sectional studies are 
therefore fraught with great hazard.° 

Let us next look at the last five rows in 
Table 2. These rows represent the most gen- 
eral case of the longitudinal method, that is, 
independent random samples of the same co- 
hort drawn at successive times of measurement 
representing different ages.* The hypothesis 
to be investigated here asks whether there are 
differences in a given characteristic for sam- 
ples from the same cohort measured at dif- 
ferent times. This question is concerned with 
the effect of age differences within the same 
population and must be asked whenever one 
is concerned with predicting age differences in 
behavior over time. Longitudinal age differ- 
ences, however, are obviously confounded with 
time differences. In cross-sectional studies, 
great risks are taken if one attempts to gen- 
eralize age differences from one time of meas- 
urement to another. In the longitudinal design, 
it is for the same reasons equally risky to 
generalize age differences over time from one 
cohort to another. Personality theorists inter- 
ested in developmental problems have tradi- 
tionally been willing to take this risk, typi- 
cally in the special case where the same sam- 
ple is measured repeatedly. It will be shown, 
however, that such risk is a needless one which 
can be avoided by more efficient designs. 

A third method of developmental analysis 
is suggested also by the general model. It is 
not generally recognized but should be of 
particular interest to the social psychologist. 
This is the method of time-lag illustrated by 
the diagonals in Table 2. The hypothesis here 
is concerned with asking whether there are 
differences in a given characteristic for sam- 
ples of equal age but drawn from different co- 
horts measured at different times. This is a 
research design which should be used when- 


iti i hich 

i roblems pose additional issues wW 
a at at as soon as one proceeds beyond 
‘An extended discussion of sampling 
Sec- 


must bi 


ral case. 
attest and their management for the cross: 


tional method has been given elsewhere (Schaie, 
iona 


oY ald be noted that 


is si jal case involving 
i thod is simply a specia 
idin a measurement of the same sample from 
mena cohort. The advantages and liabilities of the 
F Ariel measurement case will be considered in a 
repea 


later section of this paper. 


the conventional longi- 
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ever the investigator is interested in the effect 
of cultural change upon the performance of 
individuals of similar age. Again, this effect 
cannot be measured directly due to its con- 
founding with cohort differences. 


Methods for Separating Sources of Developmental 
Change 


It will now be shown that the general model 
may be used to generate methods which make 
it possible, given certain assumptions, to 
separate the three sources of developmental 
change. Several definitions will be required 
for this purpose: 


1. Cohort difference (Cd) = the net effect 
of differences between a younger cohort, C; 
and an older cohort, C;, for random samples of 
subjects controlled for the effect of age and 
time of measurement; that is, net changes be- 
tween generations. 

2. Time difference (Td) = the net effect of 
differences between an earlier time of measure- 
ment, T;, and a later time of measurement, T; 
for random samples of subjects, controlled for 
the effect of age and cohorts; that is, net 
changes within the environment. 

3. Age difference (Ad) = the net effect of 
maturational change (over time) from a 
younger age, An, to an older age, An, between 
random samples of subjects controlled for the 
effect of cohorts and times of measurement; 
that is, net age changes within the organism. 


Next, let us define the three methods of meas- 
uring change over time, which we have deduced 
from the general model, as consisting of the 
following additive components: (a) Cross- 
sectional difference (CSd) = age differences 
plus cohort differences (CSd = Ad + Cd); 
(6) Longitudinal difference (LOd) = age differ- 
ences plus time differences (LOd = Ad + Td); 
(c) Time-lag difference (TLd) = time differ- 
ences plus cohort differences (TLd = Td + Cd). 
Transposing and changing signs, it can be 
shown that: 


Ad = CSd — Cd, 
Cd = TLd — Td, [2] 
Td = LOd — Ad. [3] 


It can also be shown by substitution that the 
following relationships will hold: Substituting 


[i] 
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Formula 3 into Formula 2, and Formula 2 into 
Formula 1 will yield 


Ad = CSd — [TL4 Adj}. [4 


Removing parentheses, collecting terms, and 
solving for age difference, it is then found that 


_ CSd — TLd + LOd [5] 
DO EEE, 


(LOd 


Ad 


Similarly, substituting Formula 1 into Formula 
3, and Formula 3 into Formula 2, we obtam 


Cd = TLd — [Lod — (csd — Cay}. [6] 


Again removing parentheses, collecting terms, 
and solving for cohort difference, it is fown 
that 


rca) 


ca = Tid Lod + cSd, 


Finally, substituting Formula 2 into Formula 
1, and Formula 1 into Formula 3, it may be 


found that 
Td = LOd — [CSd — (TLd — Td)]. [8] 


Solving Formula 8 for time difference, it is the? 
shown that 
0) 


ng a LOL ca TA. 


Unfortunately, it is not possible to comput? 
these net effects for the general model as 2 
whole, since equal representation cannot be 
obtained for all parameters. As indicate¢ 
above, one can either measure all cohorts at 
all possible measurement times with unequ@ 
representation of ages, or one can measure al 
cohorts at all ages with unequal representation 
of measurement dates; or lastly, all ages can 
be measured at all measurement times wit 
unequal representation of cohorts, ‘This re 
striction is not severe, however, since it is in 
most cases virtually impossible to obta!? 
successive independent random samples over 
the life span of all cohorts in a given populatio™ 

While it is impossible to separate the soure™ 
of variance associated with development? 
change for the total model, such unconfoundinë 
can readily be effected for any set of samp, 
for which independent estimates of each of a 
three developmental components are availa B 
With respect to age and cohort dilfere?® 
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o Som for any subset or subsets con- 
e pol of two cohorts; independent 
aro E from which are measured at 
wo geen also be accomplished wih 
with a a — differences for subsets 
random a e on ngi hinpanen 
ages being measured at two ti P g these 
unconfoundj i at two times. Moreover, 
time and ares can be effected with respect to 
ing of at le; hort differences for subsets consist- 
. ast two cohorts which are measured 
at least twice. 
A r s van subset which can be subjected 
which ts a ae ge is indicated in the group 
sities hs oy by a triangle in Table 2, 
specified as ues all the minimal combinations 
tion ERE i Depending on the generaliza- 
economical oon "i ill therefore usually Be most 
Cohorts ma ore to measure at least kiaia 
Cohorts at Te times or to measure many 
measuren east twice. Ifa third cohort or 
urther ment occasion 1S added, moreover, 
her checks on one’s assumptions become 
Possible, 
dee Samiri the above considerations 
Enni that it becomes possible to un- 
sooth, the sources of developmental change 
two or as one begins to analyze simultaneously 
more behavior sequences- In view of 


the : 
ioe advantage of obtaining uncon- 
ed measures of age changes; it follows 

ld be preferred 


that 

qt sequential strategies shou i 
Sectional onventional longitudinal and cross- 
Priate s methods in most instances. Appro- 
co a _ strategies will next be 
independ in detail; first, for the case Pa 
Case of os samples, and then for the specia 

peated measurements. 


Conort-SEQUENTIAL METHOD 


W 
henever the subset of samples tO be 
1 cohorts 


exami 
amin : 

ne contains measures for al 
it o ured at all ages, it may be concluded that 


i 
1S the į : Á 
€ investigator’s intention to generalize 


Ove 
o sts ort differences. Such a design samples 
ife span ree at many times within the 
referred t the cohorts. This design will be 
Since loi o aS the cohort-sequential metho 
Cohorts gitudinal sequences for two oF more 
Co Gilgen examined simultaneously. The 
age i method permits inferences 35 
anges at all points of the age range 


a and, also, inferences about cohort 
aed at all ages. The two bottom rows 
o Table 2 are an example of a typical cohort- 
sequential design. i 
The entries in Table 2 which have been 
labeled a, b, c, and d represent the minimum 
case for this type of analysis. The questions 
which can be answered in the illustrative 
minimum case are as follows: 


What is the net age change over one time unit 
(year) at age 5 (as sampled by Cohorts 1958 
and 1959)? 
Substituting the appropriate cell labels from 
Table 2 into Formula 5, it may be shown 
that g 


Ad= (C e (c—d) _ a—b+c—d 
> $ 


at is the net age change from 


Or, in general, wh 
ampled for cohorts i and 7? 


age m to age n as S: 


CiAm — CiAn t C Am — CjAn 
2 : ~= [10] 


Adcs = 


The estimate of the net age difference is ob- 
tained by averaging age differences for two 
successive cohorts measured over successive 


time intervals. 


net cohort difference on a given 


What is the 
ween Cohorts 1958 and 1959 (as 


variable bet 
sampled at ages > and 6)? 


Substituting into Formul: 
that 


a7, it will be shown 


(c—a)— (c—€) + (c—5) _ ctd—a—b 
Cd= 7 7 : 


hat is the net cohort difference 


Or, in general, w 
sampled at ages ” 


between Cohort: 
and 1? 


5 i and j; 


CjAm + Cin = CiAm — Cifkn 
Cdes = 1 2 7 [11] 


averages cohort differences at 


This estimate 
sured over two successive time 


ages mew 


two 

intervals. 

What is the interaction between age and cohort 
confounded 


(attributable to and 


differences a 
t differences)? 


with time of measuremen 
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Substituting into Formula 9, it is found that 


(c—d)— (c—b)+(c—a) _ btc—a—d 
2 2 


Td 


Or, in general, what is the average net change 
per time unit for Cohort i at age m and Cohort 
j at age n? 


— CiAn+ CiAm — CiAn — CA, 


Tdes 7 [12] 


This estimate averages cohort differences ir- 
respective of, and controlling for age level in 
order to obtain a time difference. It is 
necessarily confounded as it is impossible to 
measure two different cohorts of the same age 
at the same time of measurement. 


The analysis of variance may be used to 
provide a test of significance of the net differ- 
ences in the cohort-sequential design. Table 3 
illustrates the relationships described above in 
the analysis of variance context. It may be 
noted that increasing the number of successive 
times of measurement for all cohorts will 
extend the age range for which cohort differ- 
ences can be evaluated, while increasing the 
number of sequences would augment the 
number of cohorts over which age differences 
can be generalized. 

The cohort-sequential design is clearly the 
method of choice when a test of the generaliza- 
bility of constructs over different generations 
(cohorts) is required. It will be noted that this 
design controls for cohort differences which 
might be attributable to genetic changes at all 
ages and that the cohort differences are con- 
trolled for age differences. But it is not pos- 

sible in this design to identify an unconfounded 


TABLE 3 


ANALYSIS OF VARIANCE FOR THE 
Cowort-SEQUENTIAL MODEL 


K. WARNER SCHAIE 


time difference component. This means that the 
cohort-sequential method will yield unambiguous 
results only when the assumplion is met thal 
changes in the variable under stud y are unrelated 
to cultural change.’ This assumption is rarely 
met except in the study of physical attributes 
or in investigations involving infrahuman 
organisms. Even when this assumption is met, 
it is as difficult to obtain an ample coverage 0 
the life span of the organism in this variant 3S 
in the single-cohort or repeated measurement 
longitudinal design. Moreover, it will be see? 
that the cohort-sequential method will pio 
vide no more accurate estimates of age differ- 
ences than do the more efficient alternative 
procedures, 


Time-SEQUENTIAL METHOD 


The first alternative procedure to be ad- 
vocated is called the time-sequential metho 
and applies whenever the subset of samples 
under examination contains measures of & 
ages at all times of measurement, The time 
Sequential method permits inferences as to 28° 
differences at all points of the age rang? 
covered, as well as inferences about time diffe!” 
ences (or cultural shifts) at all ages, The first 
two columns of Table 2 give an example of # 
typical time-sequential design. 

The entries in Table 2 which are labeled & 
Í, & and h represent the minimum case for 
time-sequential analysis. The propositions 


which can be evaluated for the illustrative case 
are as follows: 


What is the net age change over one time unit 


(year) at age 5 (as sampled at Measurement 
Times 1960 and 1961)? 


Substituting the cell identifiers from Table 2 
into Formula 5, it is found that 


Degrees of 
Source of variation freedom 
Between cohorts (C) E 
Between ages (A) A— 
Cohort X age interaction (C—1)(A—1) 
Error N»— (C) (A) 
Total variation N-1 


a Number of observations. 


Ad= an (Fi (f=g) _ fre’. 


Or, in general, the net age change from age w 
to age nas sampled at Times # and Zis given by 


Ads TiAntTAn—T,An— TAn i 
2 

jal 

5 The conventional longitudinal method as a speci 


case of the cohort-sequential method is encumbe" 
by the same restrictive assumption. 


p13] 


& 
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The estimate of age differences is obtained by 
averaging age differences at two successive 
times of measurement for two successive cohort 
pairs. 


What is the net time-lag (cultural shift) for one 
time unit (year) at Time 1960 (as sampled at 
ages 5 and 6)? 


Substituting into Formula 9, it will be shown 
hat 


ta- U-99- 0+ fH) _ et mah 
2 2 


or in general, the net time-lag difference from 
me k to Time J as sampled at ages m and z is 


Tdys = DAt TAn Tiha Tihs eels 


This estimate averages the time lag differences 
two ages for two successive pairs of cohorts. 


What is the interaction between age and time- 
an differences (attributable to and confounded 
ith cohort differences)? 


Substituting into Formula 7, it is found that 


ca- Y-M- =o 9 _ HE, 
2 


Or in general, the average net difference be- 


twee £ 
n cohort at Time k for age m 
and T: s measured a 


ime / for age n is 


Cdeg= Tada Tidy Di Tide | [isd 


a estimate averages age differences, for 

ho e times of measurement to obtain a 

Bo rt difference. It is confounded, as it 15 im- 
Ssible to obtain members of the same cohort 
© retain the same age at two different times 
Measurement. 


ay: analysis of variance is again the 3r: 
i Priate technique for the evaluation o 
hay in the time-sequential design. oe 
time €s the analysis of variance design for the 
to the €quential model. In this design, adding 
Casi number of ages measured at each time 0 
wh: wtement will extend the age range for 
Nerea, time-lag differences can be evaluated. 
Woul sing the number of times of measurement 
Serve to augment the degrees of free- 


TABLE 4 


ANALYSIS OF VARIANCE FOR THE TIME- 
SEQUENTIAL MODEL 


Degrees of 
Source of variation freedom 
Between times (T) T-1 
Between ages (A) A-1 
Time X age interaction (T—1)(A—1) 
Error N—(T)(A) 
Total variation N-1 


dom available for evaluating overall time-lag 
differences. 

Two measurements of every cohort will 
suffice to permit the evaluation of time-lag at 
all ages of interest to the investigator. It 
would seem, therefore, that this method should 
be favored whenever generalizability of a con- 
struct over time is at issue. Note that the 
time-sequential model controls age differences 
for cultural shift and time-lag differences for 
the effect of age. It is impossible, however, to 
identify an unconfounded cohort difference 
component. This means that the lime-sequential 
method will yield unambiguous results only when 
the assumption is met that change in the variable 
under study is unrelated to genetic or cohort- 
specific environmental chan ges.® This assumption 
is a hazardous one only when physical attri- 
putes are involved or when behaviors are 
studied which are thought to be afiected by 
gross environmental crises such as wars or 
severe depressions. The assumption would 
seem to be quite tenable for many variables in 
the areas of ability, personality, learning, 
interests, and, in fact, in most matters of 
concern to the developmental psychologist. 

Jf neither of the assumptions underlying 
the time-sequential or cohort-sequential model 
can be met, then recourse must be taken to a 


third sequential model. 


Cross-SEQUENTIAL METHOD 


s-sequential method may be applied 

shenever & subset of samples contains meas- 

a of all cohorts at all times of measurement. 

The cross-sequential method permits inferences 

i i ime- tial 

sumptions given for the time-sequen 

H a rom to the conventional cross-sectional 

mo 


method. 


The cros: 
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as to time-lag differences at all points of the 
time range covered as well as to cohort differ- 
ences at all times of measurement. Any rec- 
tangular area of Table 2 may be subjected to 
cross-sequential analysis. 

The entries of Table 2 which are labeled å, j, 
k, and l represent the minimum subset for 
cross-sequential analysis. The propositions 
which can be evaluated for the illustrative case 
are as follows: 


What is the net cohort difference on a given 
variable between Cohorts 1956 and 1957 (as 
sampled at Measurement Times 1962 and 
1963) ? 

Substituting in Formula 7, it is found that 


ca= U2- ENHE eo f 


Or, in general, the net cohort difference be- 
tween Cohorts i and j sampled at Times & and 
Lis 

Cdxs= CTteT CDCA . [16] 


This estimate averages cohort differences at 
two times of measurement over two successive 
age slices. 


What is the net time-lag (cultural shift) for 
one time unit (year) at Time 1962 (as sampled 
for Cohorts 1956 and 1957)? 


Substituting into Formula 9, it is found that 


(i—k)— (Gf) + (i—) _ i 7-2-1 
2 2 ` 


Td= 


Or, in general, the net time difference between 
Times & and / sampled for Cohorts i and j is 


CT 7A CT.— CT— CT: 7 
5 As DA] 


Tdx S 


This estimate averages time differences for two 
cohorts over two successive age slices. 


What is the interaction between cohort and 
time-lag differences (attributable to and con- 
founded with age differences)? 


Substituting into Formula 5, it is found that 
j—j)— (i—)+(i—k) _ itl—-j—k 
Ad= a VTS as eE a 
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Or, in general, the average net age increment 
? t=) z: f=} D 

per age unit for Cohort i at Time k and for 

Cohort j at time / is 


Riess aa . [18] 


The analysis of variance design for the cross- 
sequential model is given in Table 5. Adding 
to the number of cohorts in this design will 
increase the range over which time differences 
can be generalized, while augmenting the num- 
ber of times of measurement will permit gene!” 
alization over time for the cohort differences. 
Adding both rows and columns will in addition 
increase the generalizability of the cohort-tim® 
interaction. 

The cross-sequential model controls tim® 
differences for changes over generations a? 
cohort differences for cultural shift. No W™ 
confounded age difference component is avat 
able in this model, and consequently it follows 
that lime-lag methods will yield unambignows 
resulls only when the assumption is met that 
change in the variable under sludy is unrelalel 
to age changes. This assumption may be tenable 
for many characteristics which remain stab’? 
over substantial portions of the organism $ 
life span. It is not usually supportable i 
developmental studies of childhood but M% 
frequently apply to investigations involving 
behavior change over time in adults, It shou! 
be noted that the cross-sequential model ca” 
be applied to any research design employing 
either the cohort or time-sequential method 
by simply adding one more sample as indicate 
in the upper triangle in Table 2. 


TABLE 5 


ANALYSIS OF VARIANCE FOR THE 
Cross-SEQUENTIAL MODEL 


Degrees of 


Source of variation freedom 


Between cohorts (C) C—i 
Between times (T) T-1 1) 
Cohort X time interaction (C=1)(T— 
Error N-(C)() 
Total variation N-1 


STUDY OF DEVELOPMENTAL PROBLEMS 


Empirical Checks for the Assumptions for the 
Sequential Models 


Numerous research situations will arise 
as there is no independent evidence as to 
A anette of the assumptions required 
tial i se inferences from the sequen- 
etwee; ategies. From the known relations 
chan; s ie components of developmental 
na Ar Sall now be possible to point to a 
tions i of empirical checks of the assump- 
conse t will also be possible to identify the 
tions quences of known violations of assump- 
confe since their result will not be random 
Peis but will lead to combinations of 

é ‘ng which may be of inherent interest. 
Quire c eck of assumptions will generally re- 
els, os analyses for two of the mod- 
Combin 1s requirement is unreasonable for a 
sequ n of the cohort-sequential and time- 
involve na Procedures but, as was pointed out, 
ion S only little extra effort for a combina- 

Of either with the cross-sequential strat- 
et us now examine each of the eight 
ore ee situations. a 
hange oe as unrelated to developmen al 
of 4... Bis situation may be detected by any 
both si strategies and will obtain when 
Significant effects and the interaction are non- 
is Jen wut: Since some developmental change 
there ae to operate in almost any variable, 
Sis, Eo anger in accepting the null hypothe- 

alysis where no differences are found in an 
tious in via two models, one should be cau- 
arge sa ng from such findings niles 
‘elation, as have been used. Such a lack of 
limite Ship will most likely be found only for 
eosa time, or cohort spans. 
KA) nse is a simple function of age: R= 
itom ș this situation is true, Ad estimates 
Will be © cohort- and time-sequential models 
Seq enti alent. The Cd term in the cohort- 
tial. in ‘al and the Td term in the time-sequen” 
term  telS will be zero. Both Cd and 
alternat the cross-sequential model will be 
qual € estimates of age change and will 
nife €ach other, No interaction can be S18- 


possib] 


Mean „M this instance. Such a finding would 
diffe ‘ at there are no genetic or cultural 
foung ces, a circumstance most likely to be 
VOlving © Well-controlled animal studies in- 


g Š 
© acquired behaviors. In human sub- 
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jects, this outcome implies that the behavior 
studied varies as a function of a stable 
maturational sequence. 

Response is a simple function of cohort 
differences: R = {(C). Here the Cd estimates 
from the cohort- and cross-sequential models 
will be equivalent. The Ad term in the cohort- 
sequential and the Td term in the cross- 
sequential procedures will be zero. Both Ad 
and Td terms in the time-sequential model 
will be alternate estimates of cohort differ- 
ences and will equal each other. Again, no in- 
teraction can be significant. These results 
would mean that there are no cultural or age 
changes but that response differences must be 
attributed solely to variation among genera- 
tions. This situation will arise rarely, except 
where true genetic changes in physical at- 
tributes may be observed. 

Response is a simple function of time dif- 
ferences: R= {(T). In this case the Td esti- 
mates from the time- and _cross-sequential 
procedures will be equivalent. The Ad term in 
the time-sequential and the Cd term in the 
cross-sequential model will be zero. Both Ad 
and Cd terms in the cohort-sequential model 
become alternate estimates of time differences 
which will equal each other. No interaction 
can be significant in this case. This model 
implies that there are no age or genetic dif- 
ferences and that response differences must be 
attributed to time or cultural changes. A situ- 
ation of this type will probably prevail for 
behaviors which ordinarily remain stable once 
they are acquired but which may be modified 
due to environmental changes. W ord pai 
attitude patterns, occupational choices, modes 
of dress, etc., may fit this model. an 

Response is @ function of age and cohor 

eee es: Rm JOC). T both age ana co- 
apen -o present, the Ad term from 
Hori gee tal ae will be significantly 
me agony Ad term from the cohort- 
larger than fi i t be so because 

tial model. This must - 
ae is the net Ad estimate, while the 
the Be ated by cohort differences. Simi- 


- is infl ; 
former the Cd term in the cross-sequential 


be willbe significantly larger than the Cd 
ier in the cohort-sequential model. The 


cohort-sequential Cd term is, of course, the 
proper Cd estimate in this case. The Td term 
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in the time-sequential model will here actually 
provide an alternate estimate of net cohort 
differences, and the Td term in the cross- 
sequential model is the alternate estimate of 
net age differences. The interaction term 
in the cohort-sequential model, if significant, 
implies differential age changes for different 
cohorts. The interactions for the other models 
are confounded and not interpretable. Most 
motor skills and perceptual variables which 
have maturational sequences and show some 
genetic shift, but which are unrelated to cul- 
tural change, will fit this paradigm. 

Response is a function of age and time dij- 
ferences: R={(A,T). If developmental 
change is due to both age and time differ- 
ences, then the Ad term from the cohort- 
sequential model must be significantly larger 
than the Ad term from the time-sequential 
model. The latter is the true Ad estimate, 
while the former is inflated by confounding 
with time differences, Furthermore, the Td 
term from the cross-sequential model will be 
significantly larger than the Td term from the 
time-sequential model. The former is inflated 
by confounding with age differences and the 
latter is the net Td estimate. The Cd estimate 
in the cohort-sequential model in this case 
provides an alternate estimate of time differ- 
ences and the Cd estimate in the cross-sequen- 
tial model estimates the net age differences. 
The interaction term in the time-sequential 
model, if significant, may be interpreted to 
mean differential age changes at different time 
intervals. The interactions for the other mod- 
els are again confounded and consequently 
not interpretable. Most acquired behaviors 
with maturational sequences will fit this 
paradigm provided they are unrelated to ge- 
netic changes. 

Response is a function of cohort and time 
differences: R=f(C,T). This paradigm is 
perhaps the most important one for the stu- 
dent of adult age changes, as maturational 
sequences have often been imputed by re- 
searchers where careful analysis would have 
shown cohort and time differences but no 
maturational change whatsoever. In this situ- 
ation the Cd term from the cross-sequential 
model will be the true Cd estimate, and it 
will be significantly smaller than the Cd 
term from the cohort-sequential model. The 
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latter represents the cohort differences Plt 
time differences. Also, the Td term from the 
cross-sequential model will be significantly 
smaller than the Td term from the timè 
sequential model. The former is the net +® 
estimate, while the latter again represents tH? 
cohort difference inflated by time difference 
The Ad term from the cohort-sequential mo 
yields an alternate estimate of time differ 
ences and the Ad term from the time-seque™” 
tial model represents net cohort differen 
The interaction term in the cross-sequent 
model, if significant, may be interpreted 5 
differential time differences for different el 
horts. The interactions for the other mode 
are again not interpretable. This para ip 
will fit many adult behaviors which pe 
through the individual’s life span once th 
are established at maturity, but which SB 
differences due to genetic and cultural facta 
Some of the mental abilities, many Pe e 
ality variables, and certain physical chari 
teristics which are modified by nutritional ay 
other environmental intervenors will prob% 
be found to show this pattern upon Pr? 
analysis. d 
. Response is a function of age, cohort on 
time differences: R = f(A,C,T). The last 
ternative will probably be found in the t° 
life-span study, or at least in studies inv? 
ing childhood and adolescence of the kit! 
variables which in adults will ft the 
{(C,T) paradigm. It will, of course, also ape 
to the limited number of variables W 
Maturational sequences which show both o 
tural and genetic changes. In this situatio 
the Ad term from the cohort-sequential m0 g 
will be inflated by time differences and H 
Ad term from the time-sequential model i 
be inflated by cohort differences, Similarly, ‘4 
the cross-sequential model both the Td ®, 
Cd terms will be inflated by age differe” d 
The Cd term in the cohort-sequential md 
and the Td term in the time-sequential me 
both represent cohort differences plus ge 
differences. The interaction in the coho’ 
quential model implies differential age Pyp 
for different cohorts, and the interacti®® oo! 
the time-sequential model represents diff? 55 
age changes at different times. The Ca 
Sequential interaction, if significant, ars 
represents differential cohort changes # 


tal 
Í 
Z 


- 
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Foes periods. It should be noted that 
a apis Ad + Cd and Ad + Td terms 
described represent what are conventionally 
as cross-sectional and longitudinal 

age changes. 
E uA the general developmental 
complete explained previously does not permit 
stances oo under these circum- 
the form ah additional assumptions about 
e inyi of the age gradients: are introduced. 
P who is interested in the 
ences eL of net time or cohort differ- 
Pail Te can always obtain such in- 
Tanges i by analyzing the age range or age 
change at which little or no maturational 
Such gd Practically all organisms show 
inital tiods for most variables of interest. 
Can be if two or more successive cohorts 
it then pang which show no genetic change, 
timate comes possible to get net Ad and Td 
are ie If two or more measurement times 
tural shift for which there is little or no cul- 
Ret Ad an. then it is also possible to obtain 

Table ea Cd estimates. 

ing of th Presents a summary of the mean- 
ach of ; various difference components for 
SPection z above-described paradigms. In- 
Bator with this table shows that the investi- 
i ™ be Out prior information on the para- 
be in fitting his data will be most likely to 
Use Information on net age differences by 
on © cross-sequential method. The best 
obtaining net cohort differences will 
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be found in the time-sequential method, and 
coverage for most contingencies in finding net 
time-lag differences will be obtained by using 
the cohort-sequential method. 


Decision Rules 


From the above paradigms, it now becomes 
possible to generate logical decision rules for 
the evaluation of empirical data which will 
permit inferences as to the proper meaning of 
any developmental changes found in such data. 
The reader must, however, be cautioned that 
use of the proposed decision rules is conditional 
upon the tenability of our assumptions about 
the slope of the developmental gradients. The 
scheme presented in Table 6 assumes that all 
differences are unidirectional and positive; that 
is, an incremental progression exists from a 
younger to an older age, from an earlier to a 
later measurement time, and from a younger to 
an older generation. The scheme will still 
work if all gradients are negative, but diffi- 
culties arise when one of the developmental 
components differs in slope. Appropriate modi- 
fication of the schema and the resulting decision 
rules will therefore be required whenever any 
of the components of developmental change 
departs from the assumptions of unidirection- 
ality and/or linearity.’ 

7 Nine basic types of slopes may be recognized. 
These are: zero, positive, negative, concave, convex, 
positive asymptotic, negative asymptotic, accelerat- 
ing, and decelerating. As a result there are as many 


as 729 different models when the slopes of the three 
components are jointly considered. In certain non- 


TABLE 6 , 
‘Ntexreeramion or DIFFERENCE COMPONENTS OBTAINED BY THE ARES oe VARIANCE FOR THE 
SEQUENTIAL MODELS UNDER THE DIFFERENT ASSUMPTION 
ial Model Time-Sequential Model 
Paradigm Cohort-Sequential Model Cross-Sequential Mo — a 
sme-lag CdXTd ime-lag igi 7 l 
R w ag ana oe ie interact. diff. diff. interact. 
S z aut. interact. p 
Reta) a a ee 
RO Ad = = Ad ee Ca Cd E 
IA, mo m L oS wa = cl Adc — 
ai ae j a 
AD Ad ca aaxca | AdtCd Âd ra — Td Ad TaXAd 
REED Ad-+Td Td = Ad aay Fouts CaTa Ci oa aan 
~ ei 4 
et Ad+Td caer AdxXCd Ad+Cd Ad+Td cdxTd 
as à Ids true also for the conventional longitu- 
“sotto Sp pet art = ial moet model applies to the conventional cross- 
or the ti =! 


Nal mep While the age-difference column 


Od. All differences are assumed t 


o be unidirectiona?- 
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Decision rules will now be given for data 
which are analyzed by both the cross-sequential 
and time-sequential methods and also for data 
which are analyzed by both cross-sequential 
and cohort-sequential methods. It is assumed 
that few investigators will have occasion to 
analyze a set of data by both cohort-sequential 
and time-sequential methods, but similar 
decision rules for this case can readily be 
derived from Table 6 if needed. 

Decision rules for the TS-XS analyses are as 
follows: 


1. If Tds ~ 0; then R = f(A); if not, and 

2. if Td,, ~ 0; then R = f(C); if not, and 

3. if Cda œ 0; then R = f(T); if not, and 

4. if Tda + Tda — Adn ~ 0; then R 
= f(A, C); if not, and 

5. if Tda + Cda — Tda 0; then R 


= f(A, T); if not, and 
6. if Td + Adu. = Tdi ~= 0; then R 
= £(C, T); if not, then R = f(A, C, T). 


Decision rules for the CS-XS analyses are as 
follows: 


1. If Cde œ 0; then R = f(A); if not, and 

2. if Ade œ 0; then R = f(C); if not, and 

3. if Cda 0; then R = f(T); if not, and 

4. if Cda + Tda — Cde 0; then R 
= f(A, C); if not, and 

5. if Cda + Cde — Ades 0; then, R 


= Í(A, T); if not, and 
6. if Ad + Cd,, — Cd.,™~0; then R 
= {(C, T); if not, then R = f(A, C, T). 


Empirical estimates will, of course, be sub- 
ject to sampling fluctuations and the definition 
for zero will have to be subject to the level of 
confidence specified for a given experiment.’ 


REPEATED MEASUREMENT DESIGNS 


Whenever experimental error is large or 
where differences between experimental condi- 
tions, though significant and stable, are ex- 


linear designs, moreover, confounding may not be 
complete. It is unlikely that there are empirical data 
which fit each of the possible models. Also, empiri- 
cal data might equally well fit several of these 
models. A more detailed analysis of this problem 
goes beyond the scope of this paper and will be 
attempted elsewhere, 

8 The decisions will be based on a comparison of 
the mean squares obtained from the analyses of vari- 
ance as detailed in Tables 3-5 and 7-9. 
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pected to be small, variability may be reduce 
markedly by use of the matched group desis 
The conventional longitudinal method is © 
sentially an application of such a desig" 
= : ; : hed gro 
will now be shown how the match alt 
design can be applied to the sequential aa 
gies and models discussed above. Before wie 
into such detail, it will be useful to aon 
some of the general problems of ma 
group designs in developmental stūdies a it 
In a repeated measurement design t” 
vestigator obtains measures on the same - 
ple of subjects at different times. Such "i, 
tition poses no problems where physic is 
tributes are involved, but complication’ 
whenever variables are studied wer p 
amenable to practice or where the testing ad 
cedure affects the behavior to be merit 
Instead of comparing independently here! 
groups of subjects and being limited u 
to statements of the difference betwee” P 
means, the repeated measurement desis” yp 
mits comparing each observation of ê sw 
ject’s behavior with some earlier 108 
measurements. In theory, therefore, it j n 
be possible to detect small but reliable c t 
because the matching procedure will Te" yn 
more sensitive estimates of such change 
less adequate controls are provided, ane of 
are for practical and understandable A tH 
usually lacking in developmental studiè? y 
small but significant changes may simP” 
a function of practice or other artifacl™ yi 
It is not difficult to draw new rai 
samples from a carefully defined pop" ‘it i 
over long periods of time. In contrasts alt 
virtually impossible to maintain a single ^p 
ple in its entirety over any length of a 
Unfortunately, sampling attrition is not wl 
dom but is often related to the very val p 


ments are available is substantially S yé 
the original sample, it is, of course, "° i 
sible to generalize even for the single je 
sampled. No such evidence is readily 2% ul 
for most of the various longitudinal it 
reported in the literature, tO, 

Attention should further be called aty 
fact that for an equal number of obser el 
the degrees of freedom available for t a” 
term used to test mean differences W! 
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oe for the replicated than for independ- 
tivi d erived measurements. The gain in sensi- 
y must, therefore, exceed the loss caused 
miare ane a smaller number of individuals 
pay a repeated measurement design will 
always h S repeated measurement design will 
design i e inferior to the independent sample 
such diffe pengen yi je differences, since 
compari erences must of necessity involve the 
Tabler of independent samples. 

iena 4 and 8 give the analysis of variance 
models or the cohort- and cross-sequential 
one repeated measurements. A time- 
ane model is not available for the 
cas measurement case, since repeated 
first soa far would not be available for the 
Set. (se ast cohort represented in each sub- 
general Table 2 and the discussion of the 
Nifican model), It will be noted that the sig- 
and rn of cohort differences in both cohort- 
an erro Ss-sequential models will be tested by 
etwee : term based on the residual variation 
tial mod Sab eate which in the cohort-sequen- 
ewen adel will have N — (N/A) — C (A = D) 
aA of freedom than in an independ- 

of obse ple design based on the same number 
e used for testing the Si the error term t 
differen for testing the significance of cohort 
ave x in the cross-sequential model will 
— (N/T) —C (T—1) fewer degrees 


Anay TABLE 7 
i YSIS ` PEN 
1S OF VARIANCE FOR THE COHORT-SEQUEN 


M r x 
Se on WITH REPEATED MEASUREMENTS 


TIAL 


Degrees of 


Source of variation T 
Bety, : 
tween subjects K 
A 
Bety 
etween cohorts teal 
Resi i 
Stdual between subjects re 
Vithi 
thin subjects ri 
B jal 
etween ages a 
Cih l 
t X age interaction (Eene i 


2 
Residual] w; 
ual within subjects 
Y 
Otal vari 


y 
E a E E E 
A A) 

v-1 


ation 


TABLE 8 


Degrees of 


Source of variation freedom 
á N 
Between subjects 7 
Between cohorts c-1 
N 
Residual between subjects Te 
Within subjects N -7 
Between times T-1 
Cohort X time interaction (C— 1)(T—1) 
N 
Residual within subjects N- (=) —C(T-1) 
V-1 


Total variation 


of freedom in the repeated measurement than 
in the independent sampling design. 

The age difference and age-by-cohort- 
interaction terms in the cohort-sequential 
repeated measurement design will be con- 
founded both with time differences and prac- 
tice effects. The error term available to test 
these differences is based on the residual vari- 
aiice within subjects. This error term will have 
N- (N/A) fewer degrees of freedom than 
the comparable error term in an independent 
sample design with the same number of ob- 
servations. Similarly, in the cross-sequential 
repeated measurement design, the time-differ- 
ence and cohort-time interactions will be 
confounded with age differences and practice 
effects. The error term available to test these 
differences will have V — (N/T) fewer de- 
grees of freedom than the comparable inde- 
pendent sample design. f ; 

It follows from the above discussion that 
the repeated measurement designs wL i 
come efficient only where measures en sie 
reliability oT where measures are tal en a 

any ages or many times. F urthermore, it is 
ne sible to separate age differences from 


imposs = 
ae effects in the cohort-sequential model, 

rac P G i Ry 
sa it is impossible to separate time differ 
a s 


ces from practice effects in the cross-se- 
qquential model. The precise meaning of the 
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confounded differences under various assump- 
tions may be obtained from Table 6 by adding 
practice effect to each of the entries in the age 
difference column for the cohort-sequential 
and in the time-lag difference column for the 
cross-sequential model. These relationships in 
the case of the repeated measurement designs 
will hold only if independent sets of samples 
are used for the cohort- and cross-sequential 
analyses. Overlapping will, furthermore, add 
the practice effect to all Td and all Cd terms 
in the cohort difference column for the cohort- 
sequential model.” 


MEASURING THE EFFECT OF PRACTICE 


A direct estimate of the effect of practice as 
a component of developmental change may be 
obtained from the repeated measurement de- 
signs if, and only if, age is not significantly 
involved in the cohort-sequential design, or if 
time is not significantly involved in the cross- 
sequential design. The effect of practice is 
confounded with the other components of 
change whenever a more complex set of as- 
sumptions is required. This is unfortunate 
since it is often important to identify the 
extent to which a behavior has been altered 
by the measuring operation or to insure that 
a gain in scores cannot be explained simply 
by the increased familiarity of the subjects 
with the measuring instrument. An independ- 
ent estimate of practice effects may be ob- 
tained under all assumptions, however, if 
the investigator is willing to employ conjointly 
repeated measurement and independent ran- 
dom sampling designs. 

The procedure to be advocated requires 
that independent samples be drawn which are 
to be compared with the second or subsequent 
measurements for the repeated measurement 
group. The investigator must obtain a random 
sample from the same cohort which differs 
from the latter group only in the character- 
istic that it has been measured one time less 
than the repeated measurement group. The 
analysis of variance design for the study of 
practice effects is shown in Table 9. 

It will be noticed that the proposed design 


9 Bell (1953) presents an overlapped design which 
confounds the information separated in this paper 
by use of the repeated measurement design for the 
cohort- and cross-sequential models. 
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TABLE 9 


j 3 NAL 
ANALYSIS OF VARIANCE FOR THE Cross-SECTION 
STUDY OF PRACTICE EFFECTS 


Degrees of 
Source of variation freedom 
Between ages + cohorts (AC) Ac-1 
Levels of practice (P) peat: p-)) 
Age + cohort X practice interaction (ac—1)( P) 
Error N—(AC)( 
Total variation N-1 


has the usual cross-sectional approach ; i 
ciency in that age and cohort differences T 
confounded and in that the interaction 
therefore uninterpretable unless it is "° of 
from independent sources that the effect 
either age or cohort differences is Zero: “| io 
more complex design will be requiret ij 
remedy this difficulty and is feasible on 
an opportunity for at least three meas! C 
ments exists. In this case, it will also be of 
essary to obtain a repeated measureme™ 4 
the random sample first tested on the S¢° ie 
occasion. Furthermore, a new random $4”, 

must be drawn on the third occasion wd 

differs from both the first and second repo" y 

measurement samples only on the chara? at 

istic of having been measured once less * jt 

the other groups. Groups may then be Sar 

in random halves and appropriate ass de 
ment to the different levels of practice Phy 
The analyses for such a design will be si” 
to the ones given in Tables 3, 4, and 5 & ich 
for the addition of a main effect for pra¢ e 
and the simple and triple interaction bet™ y 
practice and the other components of desd, ; 
mental change. Such a design is quite ething | 
in the case of the time- and cross-seque™tys | 
methods. It faces the usual sampling att? f 
problems in the cohort-sequential appr 

which for human subjects is, in any aie | 
best limited to short-term studies. | 


w 


CONCLUSIONS 


The results of our analysis suggest (oe, 
the conventional longitudinal and FUA 
sectional methods for the study of dev pe | 
mental problems can lead to compie o 
results only when there are no ge? na 
cultural changes in response to the K 
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Studied. Such circumstances, however, are rare 
among the areas of interest to the develop- 
ii psychologist. Research strategies are 
ire therefore, which will permit the 
se pene of the components of develop- 
a del change Consideration of a general 
sequenti as permitted the development of 
Canton ial strategies which permit such un- 
iow nding, and it has furthermore been 
Sin ‘9 that the conventional methods are 
Se Ply incomplete special cases of these 
quential models. The hazards of the re- 
Sa measurement designs have been ex- 
fie , and alternate methods have been 
gested for the study of practice effects. 
oe now be argued that the cohort- 
oide method (and the conventional 
is - inal method as a special case thereof) 
ie inefficient. Study of, Table 6, 
infor Ver, will show that the most interesting 
ay Mation available from longitudinal study 
equally well be obtained by a combination 
the time- and cross-sequential approaches. 
k suggested, therefore, that the most 
con lent design for a developmental study 
Sists of the following scheme: 


eff 


wa ae a random sample from each cohort 

Wig os ee ae be investigated, and 
Sure at Time $ (Score A). 

ject Get a second measurement on all sub- 
s tested at Time k at Time J (Score B). 

hataw a new random sample from each 

coh; in the range tested at Time k plus one 
ort below that range, and test at Time l 
Core CS), Se, 


The 
Made. lowing comparisons may now be 


and C) (a) Cross-sequential model (Scores i 
and ©) (b) Time-sequential model (Scores, A 
“epeat ), (c) Cross-sequential model — 
(d) ha Measurements (Scores A and } 
fec We llc model, controlling for €t- 
Alm Practice (Scores B and C). 
evelon all questions of interest to the 
is = mental researcher can be handled by 
able Ra EY which should yield interpret- 
tions 10 ults in practically all empirical site 
rc -° Sampling attrition in the repeate' 
2 the Proposed research strategy has been applied 


ToBniti study of developmental changes in adult 
1964p) ° behavior (Schaie & Strother, 19642; 
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measurement part of the study should be 
fairly limited over a single time interval. 
Also, if desired, the new random sample 
obtained at Time / will provide the base for 
a second repeated measurement study, and 
any cohort or cohort sequence can, of course, 
be followed further if a conventional longi- 
tudinal study is to be pursued. 

It is hoped that this paper has clarified 
the relations between the different types of 
basic experimental designs in developmental 
psychology. A next logical step should be 
in the direction of reexamining some of the 
studies in the developmental and attitude 
survey literature with the hope that puzzling 
inconsistencies can now be resolved or 
identifed as artifacts of improperly under- 
stood experimental designs.** 


11Examples of studies containing data which 
could readily be examined via the proposed models 
may be found in Shuttleworth (1939) and in Hyman 
and Sheatsley (1964). 


— 
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A MODIFICATION OF THURSTONE’S LAW OF COMPARATIVE 
JUDGMENT TO ACCOMMODATE A JUDGMENT 


CATEGORY OF 


“EQUAL” OR “NO 


DIFFERENCE”! 


MARSHALL G. GREENBERG? 


University of 


Minnesota 


A model is proposed which modifies the Thurstone law of comparative judgment 
extending it to the treatment of pair comparison data in which judges are permitted 


a judgment category of “equal’’ or “no difference.” 
tion of the perceptual process the model introduces 


Based upon a rational descrip- 
a threshold parameter and ap- 


portions the “equal” judgments between the 2 stimuli to estimate the actual pro- 
portions perceiving one greater than the other on some underlying attribute, This 
apportionment is carried out within the spirit of the Thurstone model and is shown 


to be closely approximated by splitting the “no difference” 
tween the 2 stimuli being compared. Problems of estimation 


tions are discussed as well. 


In 1927 a pair of classic papers published by 
Thurstone (1927a, 1927b) introduced his law 
of comparative judgment thereby giving birth 
to a new era in psychological scaling. This 
model provided a means for analyzing not only 
comparisons of physical intensities, but quali- 
tative judgments as well, and led to applica- 
tions in the measurement of attitudes and other 
subjective attributes. In brief, the law of com- 
parative judgment provides a means for trans- 
forming the pair comparison data obtained 
from the k(k — 1)/2 pairs generated by k 
stimuli into a set of k scale values for these 
stimuli along some underlying psychological or 
psychophysical dimension. Thurstone speci- 
fied (1927a) that the method “applies funda- 
mentally to the judgments of a single observer 
who compares a series of stimuli by the method 
of paired comparison when no ‘equal’ judg- 
ments are allowed [p. 276].” He then pro- 
ceeded to modify the model to permit replica- 
tions of judgments across as well as within 
individuals. The present paper introduces a 
modification which permits judgments of 
“equal” or “no difference” while retaining the 
spirit of Thurstone’s approach. 


TuuRSTONE’s Move. 
No attempt will be made to describe Thur- 
stone’s law of comparative judgment in great 
! This work was supported in part by Grant GB-2345 
from the National Science Foundation. 
2? Now at The Procter & Gamble Company, Cincin- 
nati, Ohio. 
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responses equally be- 
and alternative solu- 


detail since it has been presented and et 
marized in many places elsewhere (Coon 
1964; Guilford, 1954: Thurstone, 1927a, 1927 a 
Torgerson, 1958). The present discussion a 
ply attempts to capture the flavor of the gin 2 
while laying the groundwork for the modifi 
tion to follow. POR”: 
The model assumes that when a pait ° 
stimuli is being judged with respect to sor 
attribute, each stimulus may be representi 
by a distribution over the continuum a5 E 
Figure 1. The use of distributions rather ther 
point representations reflects the notion th 
there exist momentary fluctuations in the 1 d 
vidual’s perceptions of the stimuli. Designati”? 
the entire set of stimuli by S = GS Say: i 
Si, +++, Sa} the distribution for any stimul 
Si is assumed to be normal with mean Si @™ 
variance g. When an individual is present? 
with two stimuli and is asked to judge whit 
has more of some attribute it is assumed th® 
he draws a random sample from each of oe 
two stimulus distributions and chooses p 
stimulus whose sample observation is e 
greater. Since the decision is dependent up? 
the difference between sample observation 
the model may be formulated in terms of jp 
distribution of sample differences as show” 


Figure 2. , and 
Given the individual distributions for <!“ pe 


Sj, the difference distribution is known t° 
normally distributed with mean 5; — ie p 
variance o? + o? — 2rioi;. The shaded * of 
in Figure 2 corresponds to the proporti? 


Si 


MODIFICATION OF THURSTONE’S LAW 


Si Sj 


Fic, 1. Representations of two stimuli 
along an attribute. 


nt, Š perceived as greater than S; and 
tion of ace proportion is the propor- 

sing em a u is judged to be greater than Si. 
tions oS data to estimate these propor- 
Values § ne to determine the scale 
Solving a N Si 077, Sk of the k stimuli by 
equations hie of simultaneous observational 

the form: 


q ze 

wh ae an= zyl? + Ce 2r 0103)", C1] 
to the — me normal deviate corresponding 
than g; aaa of judgments of S; greater 
knowns q PECRUSE there are many more un- 
Caseg aa observational equations, special 
ermine q T be established in order to de- 
hurst ee sho. r PERI 

tenilo e s Case V is the one which is most 
of sim employed and makes the following 

mplifying assumptions: 
L. Tnte 


freq 


Di: aindividual replications can be re- 
ced by interindividual replications. 


a 


of = a Spee oo = 


je yas a 


Nus E i 
Equation 1 becomes 
S;- S; = sj0[2(1 — ry]. 


B 
Y lett; 
Measure aa o[2(1 — A)” 
in nent and setting it equal to one, 


[2] 


be the unit of 
ta we 


8 — 8) = ay 


[3] 


as y 
Ne ol a 
Me W n a equation. 
are g escrih, for solving a set of such equations 
ps A bed by Thurstone (1927a), Guilford 
logy 20d Torgerson (1958). Mosteller 
tho” & Ra shown that these procedures 
ji Scale ast-squares solution for estimates of 
Sa Values. Mosteller also discusses the 
r actually assumed that all the inter- 
eers at zero. Mosteller (1951) pointed out 
ed on) UMPtion was never used and that in fact 
“ual, Y be assumed that all the intercorrelations 
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o 55 


Fic. 2. Distribution of sample differences 
for S; and Sj. 


solution for cases in which the standard devia- 
tions are unequal and presents a chi-square 
goodness-of-fit test for the case where the 


sigmas are equal. 


A MODIFICATION OF THURSTONE’S MODEL 


Thurstone’s model assumes that whenever 
the sample observation drawn from S; is 
greater than that drawn from S; the individual 
chooses S; as having more of the relevant 
attribute. Since the distributions involved are 
continuous the probability of equal sample ob- 
ations is zero, and a binary response will 
ir. The proposed modification intro- 
ameter k;; analogous to a threshold 
he smallest difference be- 
tween sample observations that can be de- 
tected by the individual. When the sample 
observations differ by an amount greater than 

ting the larger sample 


kij the stimulus genera 
observation is chosen. When they differ by 


less than kij a judgment of equality is offered. 
Letting Si represent a sample observation 


from Si, 


serv 
always occu 
duces a par 
value representing t 


[4] 
[5] 


s= Sj > ki? S; chosen, 


SPS kij= S; chosen, 
— sj) £ ku> 
S; and S; judged equal. [6] 
ts the distribution 
The present model segmen 
of Jee as shown in Figure 3. The shaded 


area corresponds to the proportion of judg: 
i eats of “equal,” “game,” or “no difference, 
m a 


— kij < (si 


-kjO +kj Sj-5i 
n of sample differences for Si 


istributio 
6, 3: Dis hreshold parameter hii. 


and S; with t 
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while the other two sections represent the pro- 
portion of responses of S;> S; and Sy > Si. 

At this point we may further introduce the 
assumption that k;; is a normally distributed 
variable with mean k;; and variance o°r;; = 07» 
for all i and j. Then if the values of k are 
assumed to be independent of the S;— S; 
joint distributions the effect is simply to 
increment the variance of the difference 
distributions by o”,. This will alter the unit of 
measurement but leave the solution otherwise 
unchanged. 

Once again we need to solve equations of the 
form = 

8:- S;= Zij [7] 
where 2’;; is the standardized distance between 
(5: — 5;) and 0. The relevant normal deviate, 
however, no longer corresponds to one of the 
observed proportions, but is midway between 
their normal transformations. 

Let pi; and pj: be the proportions of judg- 
ments S; > S; and S; > S; respectively. We 
lose no generality by assuming pj; < pji. 
Then 


S; -—S- kij = Sj [8] 


[9] 
where zi; and zj; are the normal deviates corre- 
sponding to p;; and p;;, respectively. Summing 
Equations 8 and 9 yields 


Š; as Š: = 1/2 (zji = Zij) = 2 53. 


and = _ N 
Bi — Sit kij = — Zij 


[10] 


The z’ values obtained from Equation 10 may 
now be incorporated into the usual solutions 
for the law of comparative judgment model to 
recover the set of scale values 81, 8», ++- 
Dg hy Bis 


? 


Discussion or THE MODEL 4 


The proposed modification of the Thurstone 
model contributes to its application in essen- 
tially two ways: (a) It permits the collection 
of pair comparison data in which individuals 


4 After this paper had been submitted for publication 
the author discovered that Glenn and David (1960) 
had independently conceived and developed the same 
modification of the Thurstone model, although their 
analytic procedure differed from that proposed here. 
This portion of the present paper has undergone con- 
siderable revision as a result of the ideas presented in 


their paper. 
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have been offered the option of judging stimuli 
to be equal. (6) It apportions judgments ° 
equality to the two stimuli in a pair comparison 
in a manner consistent with the spirit ° 
Thurstone’s model. In fact the original mo F 
becomes the special case of the modified Ve” 
sion in which k;; = 0 for all i and j. 4 

A difficulty inherent in the usual solution’ 
to the standard Thurstone model arises } 
connection with this modified version as v 
There are two sets of parameters to be i 
mated in the present model, a set of ij Ve 
and a set of interstimulus distances © 
form 8; — 5,. The interstimulus distances 9 
estimated from Equation 10. The expres is 
for estimating the discrimination threshol 8 
obtained by subtracting Equation 9 from 
which yields 


- 11] 
kij E 1/2 (zi; + Bi). [ 
e of thes? 


Denoting the asymptotic covariance ue 
om Ed 


estimates by Yk;;,5; — sp it follows fr 
tions 10 and 11 that 


Veen 8; — si = 1/4 (87.,, — Saa) 


ye Pil— pa) pil pa | g% 
=1 ew , 
n| NSP ey Ny 256 


t 
where s°.,, is the sampling variance of zij 3° 4 j 
mated from the sample proportion, fij Yih 
the ordinate of the normal curve at the P? g 
of truncation z;;, and N is the total num ind 
judgments. To the extent that the sa™P ce 
errors of z;; and zj; differ there is a cova! tel? 
between the estimates of the two param? tbe 
involved. Thus, a least-squares solution t° „yo 
problem as it stands is quite difficult, The”? jiy 
at least two ways of resolving the difi” 
presented by this covariance. 

The approach taken in the present p ih 
suggesting the use of Equations 10 and a 


plications in which the model would P® gÝ 
ployed, the proportion of no-decisio™ Yj" 
will be low, yielding small values for t2® "jt 
Consequently differences in the $2, 
errors of zi; and zj; will be minimal. 4 
proach essentially parallels that of the d w 
ard Thurstone procedure as describ? gt% 
Mosteller (1951). Mosteller’s least saion tg 
lution also fails to take into considera" sgu 
ferences in the variabilities of the zi; V4 
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to estimate scale separations. A true least- 
squares solution would assign weights to the 
estimates in proportion to their sampling 
aa Mosteller pointed out that such 
An da would be “unmercifully extensive.” 
Sorel ies justification for ignoring the 
tees estimates is the fact that the primary 
‘hile i ies in estimating the scale separations 
ie estimation of „the ki; values is 
ap F a to this more basic problem. One can 
bes i quation 10, in which the value of kij 
and ropped out, to obtain an estimate of s'i 
anal: 5p on to employ the usual Thurstone 
alysis without ever considering Equation ‘at. 
Ti ba approach is that taken by Glenn 
lem Pn (1960). In order to avoid the prob- 
the n correlated estimates they approximated 
ime difference distribution by a sine 
formatio and employed an inverse sine trans- 
lenn ion to scale the response proportions. 
Ples = ge David showed that for large sam- 
distan € covariance between estimates of scale 
aptoni and the discrimination thresholds 
with oe zero in such a model. Still faced 
estim. ightly nonhomogeneous variances among 
cin of scale separations and the dis- 
ot Zon thresholds, the authors presented 
P and unweighted solutions. Their 
présen, ted solution is essentially the one 
invers w in this paper, but employing an 
Norm, i sine transformation rather than a 
ions ee on the sample propor- 
of the heir weighted solution uses the results 
Rte eee analysis as the first step of i 
Would he xtensive iterative procedure whic 
Electro e impractical without the aid of an 
ie nic computer. Glenn and David sup- 
tained. numerical example in which they ob- 
amon values for secretaries’ preferences 
Equa ive brands of carbon paper. wi 
to t Siu 10 of the present model was applie i 
the Pa data and the z';; values analyzed i 3 
g Scale ard Thurstone procedure, the = b 
oment values produced Pearson pro uct 
Weighte Ane maior of .999 with both their 
he sh unweighted solutions. - 
®Dplicatio, ity of the model for any particular 
is Willing n depends upon the assumptions one 
Tegarding to make and the knowledge one has 
For S the properties of the parameter k. 
Pon he in which the pi; values are based 
taindividual replications, the applica- 


tion of Equation 10 and the subsequent solu- 
tion is quite straightforward. To replace intra- 
individual replications with interindividual 
replications it is necessary to assume that all 
of the judges have approximately the same 
capacities for discrimination. Even if the values 
of fy; vary across individuals the model may 
satisfactorily represent the data by yielding 
values of ;; which are averages of the indi- 
vidual threshold values. If each individual 
makes a large number of judgments it is 
possible to evaluate the assumption that the 
different judges have the same discrimination 
threshold. The procedure involves comparing 
the abilities of the solutions to reproduce the 
obtained proportions when a common kj; is 
assumed and when it is not. A chi-square 
goodness-of-fit test to a trinomial distribution 
is required and is described by Glenn and 
David (1960). 

A common problem in political polling, 
market research, and other settings in which 
binary responses are requested, but judgments 
of “equal” or “no difference” occur, is how to 
allocate the “undecided” votes between the 
two stimuli being compared. There are two 
frequently employed methods of resolving this 
problem. The first is to split the “undecided” 
votes equally between the two stimuli; the 
second is to allocate these responses propor- 
tionately by dividing each of the choice pro- 
portions by their sum so that the resulting 
values add to one. The present model produces 
aresult somewhere between these two methods. 
To the extent that the integral of the normal 
difference distribution between 0 and —ky; is 
equal to that between 0 and +k;; the “equal” 
votes will tend to be split 50-50 between S; 
and S;. These integrals or areas will tend to be 


equal whenever either of two conditions holds: 
oportion of S; judgments approaches 


he pr! 

a ‘peumartion of S j judgments. (b) The pro- 
ortion of “equal” judgments is very small 

(i.e. hea is small). It is interesting to note that 
Thurstone (1927a, P. 977) discussed the dis- 
crepancies between these two areas for very 
small deviations from 0 on the difference dis- 
able 1 illustrates the results, for a 

r of hypothetical splits, of the three 
ating the “equal” votes. 
dified Thurstone model is almost 


arly the mo r ; : 
prs its allocation of the ‘ equal’ 


as conservative in 
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TABLE 1 


RESULTS OF AppORTIONING “EQUAL” VoTEs 
BY THREE DIFFERENT METHODS 


Modified 
Data Proportional 50-50 Thurstone 
split split model 

S: S; Equal S: Sj Si S Si S; 

hh h h h hh hh 
40 40 20 50.0 50.0 50.0 50.0 50.0 50.0 
50 30 20 62.5 37.5 60.0 40.0 60.3 397 
55 25 20 68.8 31.2 65.0 35.0 65.5 34.5 
60 20 20 75.0 25.0 70.0 30.0 70.8 29.2 
60 10 30 85.7 14.3 5 77.9 221 
70 20 10 TI 22.3 } 75.2 248 
70 10 20 87.5 12.5 80.0 20.0 81.6 184 


responses as is the technique of distributing 
them equally. Conservative as used here refers 
to the probability of a Type I error associated 
with a test of the difference between the result- 
ing two percentages. For most applications 
where the present model is relevant it would 
appear quite satisfactory to permit respondents 
to employ an equality category and to appor- 
tion such responses equally between the alter- 
native stimuli followed by the standard Case V 
solution, although the use of Equation 10 re- 
quires little extra effort. 
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For any given area of investigation one 
might test the validity of the present model by 
experimentally manipulating the threshold 
parameter k. This could be accomplished by 
employing different degrees of encouragement 
to avoid using the “no difference” category I? 
the instructions to judges or by imposing 
varying time limits on the choice such that ! 
the limit is exceeded the response is recorde 
as an inability to discriminate. Applying the 
model to such data in which & has been varie 
should yield linearly related sets of scale values 
for differing values of k. 
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A descriptive model of con 


from recent brain research literature on t 
several common behavioral tasks as well as 


hases on each of these tasks. Electro- 
and psychopharmacological effects known in the 
d by the model which assumed specified 


parisons to be made between 
within the acquisition to retention p! 
physiological, neurological, 
literature were seen to be ordere 


mutually inhibitory brain systems to un 


release and control of responses. 


The importance of the situational demands 
of “the control and release of behavior” has 
long been recognized by Pavlov (1928) in his 
Studies of neural inhibition and excitation 
Phenomena, Pavlov inferred these mutually 
antagonistic neural events of excitation and 
areition from the behavioral indices of con- 
ditioned reflexes and the conditioned or un- 
Conditioned inhibitions of these reflexes. The 
ot paper will expand and elaborate on 

€ list of behavioral indices and will examine 
fe genous and exogenous changes in speci- 

mutually inhibitory brain substrates as 
r €y relate to these examples of behavior 
elease and control, The descriptive model of 
= behavioral indices will first be formed 
Bt the recent literature of this country 
‘lated to brain research on behavioral release 
le Control systems. This model will then be 
can systematically to neurological, electro- 
Physiological, and drug research. 


RELEASE AND CONTROL 


tro) Beestions of behavioral release and con: 
See oe may be seen in the discussions, = 
ie recent authors in this country. be 
Gan a ot these is the suggestion of the T 
avi of “act inhibition” to explain the e 
ie deficits in delayed response perform 
cme frontal lobe lesions (Stanley 
l S, 1949). Other authors (uations 
Thom Schwartzbaum, Kellicutt, Spieth, & 
S eee 1964) have used the similar con- 
S of “response inhibition and facilita- 


1 P ai 
aeS Paper was prepared duri 
Insch Fellowship MH-21492 
utes of Mental Health. 


ng the tenure under 
from the National 


trol and release functions of behavior was derived 


his topic. The model allowed com- 


derlie the behavioral functions of 


tion” to explain the behavioral deficits ob- 
tained on types of avoidance learning. Thomas 
and Slotnick (1963) and Vanderwolf (1964) 
have used the related constructs of “release 
and inhibition of a freezing mechanism” to 
explain their effects on avoidance performance. 
The suggestion that lesions may “release or 
control motility mechanisms” (Maher & Mc- 
Intyre, 1960) differs by specifically denoting 
the classes of behavior either released or con- 
trolled. Less similar are the constructs used by 
various authors attempting to explain lesion 
drug, or electrophysiological effects on these 
same behavioral tasks. MacAdam (1962), and 
Chalmers and Erickson (1964) have referred 
to “tonic and phasic orienting response mech- 
anisms” as being released or controlled, 
thereby interfering with the performances of 
some but not other tasks. Gerbrandt (1964) 
has used the similar constructs of “control 
and release of attentive shifting mechanisms” 


to explain various behavioral effects produced 


by lesions. 
Specific exar 
control may be 


mples of behavioral release and 
seen in the descriptions of the 
tasks used by McCleary (1961) and others 
(Ellen, Wilson, & Powell, 1964; Webster & 
Voneida, 1964). To obtain evidence of re- 
sponse inhibition, McCleary first trained 
lesioned cats to approach a food station: he 
then punished the cats for such approaches. 
Since the animals were required to control 
their approaches, the punishment phase was 
used as the critical test for inhibition. To test 
for response facilitation on the other hand, 
McCleary trained his cats to release an avoid- 
ance response of running upon a signal presen- 
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tation. He found that these two tasks dissoci- 
ated the effects of two different limbic system 
lesions. Ellen, Wilson, and Powell (1964) de- 
scribed the effects they obtained from septal 
lesions on timing behavior as “the result of 
impairment in the giving up [control] of old 
responses rather than in the learning of new 
response patterns.” Similarly, Webster and 
Voneida (1964) described the effects of hippo- 
campal lesions on reversal learning as a deficit 
in the “altering [control] of a previously 
stabilized habit.” These authors could well 
have been describing the passive avoidance 
task of McCleary (1961) or the conditioned 
inhibition task of Pavlov (1928). From each 
of these descriptions a composite has been 
formed to circumscribe the realm of behaviors 
to be examined in this paper. It is stated as 
follows: 

Discrete brain systems, mutually inhibitory 
in their effects, function on the one hand to 
release highly stabilized responses, and on the 
other hand to allow increases in the stability 
of elicitation of a response by control over 
competing responses of higher stability. 

By discrete and mutually inhibitory sys- 
tems of responding, it is meant that the opera- 
tion of either identifiable neural system is 
inhibitory on the other, and vice versa. As 
used in this paper, a behavior is stabilized 
relative to its susceptibility to interference by 
“external inhibition.” A clue to which be- 
haviors may then fall in the range of “unsta- 
bilized” or “stabilized” may be seen in the 
following quote from Pavlov (1928): 


The more experiments we make with conditioned 
reflexes . . . the more we have proofs showing that 
the process of internal inhibition is more labile (un- 
stable) than the process of conditioned excitation, 
ie, under the influence of extraneous stimuli the 
process of internal inhibition is suppressed more 
easily and more quickly than the process of condi- 
tioned excitation [p. 239]. 


Common examples of stabilized responses may 
be seen then in the performances of condi- 
tioned responses, simultaneous discriminations, 
and conditioned avoidance responses. Common 
examples of unstabilized behaviors are pas- 
sive avoidance responding, reversal learning, 
extinction, and conditioned inhibition re- 
sponding. Although stability may be gen- 
erally said to be greater in the former be- 
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haviors and less in the latter behaviors, an 
important stipulation on the degree of sta- 
bility in these behaviors is that all responses 
used to infer internal inhibition eventually 
phase out into excitation, that is, they become 
stabilized. This warning is based on the find- 
ing by Winnick and Hunt (1951) that 4 
buzzer which inhibited extinction was only 
effective in the early trials of extinction train- 
ing. Winnick and Hunt also found that 4 
conditioned running response was externally 
inhibited to a greater extent in early trials 
than in later trials of acquisition. It might 
then also be expected that the very early 
phase of acquisition of conditioned responses 
would be comparable to other more repre 
sentative unstabilized behaviors (passive 
avoidance, etc.) in the functioning of the pro- 
posed brain systems of control. The electro- 
physiological, lesion, and drug effects will now 
be examined as they relate to the above de- 
scriptive model in order to investigate the 


discreteness of the hypothesized underlying 
brain systems. 


ELECTROPHYSIOLOGICAL RECORDINGS 


Electrophysiological recording is especially 
useful for following the activities of many 
brain substrates simultaneously and temp? 
rally during the progress of learning since 1! 
allows behaviors to develop without interfer” 
ence from treatments. In that the early phas? 
of acquisition is representative of increases I” 
the stability of behaviors (and concurrent con~ 
trol of competing responses), while the late 
phase is characteristic of stabilized response) 
a “phasing out” of some neural activities 
should be expected during the course of learn” 
ing while a “phasing in” of others should také 
place if discrete brain events underlie thesê 
different behavioral functions. In a previous 
examination of the neurological literatur® 
(Gerbrandt, 1964), it was suggested that th? 
first phase might be identified with neu‘ 
operations in the second and third rhine™” 
cephalic systems of Pribram and Kruge 
(1954), while the late phase functions mig” 
be characteristic of the nonspecific projectio” 
system (Nauta & Whitlock, 1954). or. 
sistently, Morrell (1957) has found hipP® 
campal single unit firing to conditioned flick? 
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following during the early stages of training, 
where units of the thalamic nucleus ventralis 
anterioralis (nonspecific) were then inactive. 
As training progressed, hippocampal units 
Were found to drop out while the thalamic 
units increased to a plateau with the stabi- 
lized flicker following. John and Killam 
aes trained an active avoidance response 
0 a flicker conditioned stimulus (CS) while 
recording electroencephalographic (EEG) ac- 
tivity in the hippocampus. They found hippo- 
campal following of the flicker CS during 
carly but not during late training trials. Since 
the temporal superimposition of a negative 
št into the training trials constitutes an un- 
abilized behavioral situation (the negative 
a nditioned response is in competition with a 
tection positive CR which must be con- 
jee ed), it is interesting that John and Kil- 
to Faas found that hippocampal following 
out e negative CS continued after the drop- 
to the positive CS. Grastyan, Lissak, 
ladarasz, and Donhofier (1959) tested for 
'Ppocampal EEG activities during the ac- 
arisition of both classical and defensive 
nditioning, In early trials of either task, 
ere the animals were excited by the CS but 
not yet reliably demonstrated task per- 
(the ance, synchronous slow wave activity 
w rhythm) was recorded in the hippo- 
Spo; Pus, Upon stabilization of conditioned re- 
pha në; however, the slow wave activity was 
ati out into “desynchronization” OF i 
Fah Grastyan et al. also found that dif- 
tea inhibition and extinction procedures 
tyik m a temporary return of the theta 
tive ce accompanying the positive and jega 

illa and, as in the study by Jobn an 
rainin , with continued differential inhibition 
the -8 the theta rhythm first disappeared ri 
tiong stive CS. The classical aversive condi- 
lay ‚8 Tesults of MacAdam (1962) are simi- 
© those of Grastyan et al. and support at 
acrorecording level the single unit findings 
thalami L ( 1957) for the hippocampal and 
ing, ic activities during the course of tram 
por see studies indicate an overwhelming Sup- 
ivo] Or the notion that the hippocampus is 
se a in the phasing out of a control over 
bhasa S responses, and that a consequent 
8 in of release systems is occurring ™ 


foim 
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other brain areas. Unfortunately, the results 
of Adey, Dunlop, and Hendrix (1960) are in 
seeming contradiction with the above reported 
studies. Adey et al. recorded EEG activities 
from several regions of the hippocampus and 
found that the greatest synchrony of slow 
wave activity occurred during the late phase 
of stabilized approach of performance, and 
during the actual motor act of food approach 
within a given trial. These results have since 
been corroborated by Bremner (1964), and by 
Vanderwolf and Heron (1964). Grastyan et 
al. (1959) reported EEG desynchrony during 
this phase and identified a relation between 
orienting responses and slow waves from a 
photographic analysis of the approaches. Adey 
et al. reported no relation of slow wave activ- 
ity to orienting responses from their photo- 
graphic analysis. Several possibilities exist for 
making these data consistent. Grastyan et al. 
reported cases where one electrode in an ani- 
mal would be mostly synchronous where yet 
another was desynchronous, namely, during an 

actual food approach. Although it would seem 

that both a Grastyan et al. and an Adey et al. 

preparation have occurred here in the same 

animal, in both of these studies it is reported 

that the slow wave activities are best recorded 

in the apical dendrites of the pyramidal cells 

of the hippocampus. A second possibility for 

consistency may be seen in other details of the 

Adey et al. findings. During early phases of 

learning, these authors found theta activity 

to proceed temporally from septohypothalamic 
inputs to hippocampal areas CA;, to CA: and 
to exit out the subiculum into the entorhinal 
areas. During later phases of training, the 
temporal course of wave appearances was re- 
versed. If the Grastyan et al. recordings of 
slow waves could be identified with the | 
phase relations of Adey et al., then it might 
be claimed that the late phase slow ‘eee 
were present but masked in the records 0 

he former investigators. Identification of slow 
the os with orienting responses then might be 
wav ly made only during the early phase 
prop zn Finally, differences in task parame- 
aV ight be experimentally manipulated for 
f: to the discrepancy. Grastyan et al. have 
found hippocampal desynchrony to be most 
reliably found in late trials and synchrony in 


early trials of daily sessions. Also, it is not 
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inconceivable that the unitary response 
(Grastyan et al.) and choice response (Adey 
et al.) differences in task parameter may 
produce functional dichotomies (Bitterman, 
1962). 

In spite of the fact that these differences 
have not yet been brought into accord, some 
tentative hypotheses might be made in light 
of other evidence. Petsche and Stumpf (1960) 
have identified the pacemaker of hippocampal 
slow wave activities as lying in the medial 
septal nuclei. Septal lesions are known to ir- 
reversibly eliminate hippocampal slow waves 
(Green & Arduini, 1954). Septal lesions are 
also known to produce specific deficits on 
passive avoidance (McCleary, 1961), on re- 
versal learning (Thompson & Langer, 1963), 
and on the pacing of bar-pressing responses 
(Ellen, Wilson, & Powell, 1964). Septal lesions 
are known to facilitate conditioned avoidance 
responding (Krieckhaus, Simmons, Thomas, 
& Kenyon, 1964; McCleary, 1961). Induc- 
tion of theta activity with injections of physo- 
stigmine (Mayer & Stumpf, 1958), or medial 
septal implants of cholinergic stimulants, will 
block the performance (late phase) of a con- 
ditioned avoidance response (CAR) (Chal- 
mers & Erickson, 1964; Grossman, 1964). 
Scopalamine and atropinic drugs, which block 
the EEG effects from injection of cholinergic 
stimulants (Green, Maxwell, Schindler, & 
Stumpf, 1960), block only the acquisition 
(early phase) and not the performance of 
a CAR (Meyers, Roberts, Riciputi, & Dom- 
ino, 1964), Chalmers and Erickson (1964) 
also reported that atropine blocks the physo- 
stigmine-produced interference with the per- 
formance of a CAR. It would appear that 
the hippocampal theta rhythm, having its 
source in the septal nuclei and proceeding 
through pyramidal cells of the hippocampus 
to the entorhinal area, is necessary for con- 
trol of competing responses while conditioned 
behavior is stabilizing. It would appear also 
that the hippocampus mediates a slow wave 
activity identified with entorhinal, rather than 
septohypothalamic, inputs. This slow wave 
activity, however, is characteristic of late 
learning phase functioning. These hypotheses 

will be reexamined in light of lesion and drug 


studies in the following sections. 
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Lesion EFFECTS 


The notion that discrete and mutually 0- 
hibitory brain systems operate during the 
control and release of stabilized response’ 
suggests that during some task performances 
and not others the brain will have identifiable 
bias characteristics. The endogenous biases 
were seen in many studies discussed in the 
previous section. In this section the conte 
will be with the experimental biases produce 
by various lesions. Since such biases are rele- 
vant to discrete systems, rather than centers, 
of neural activity, knowledge of the projec 
tions and inputs to lesioned nuclei will 
more crucial in interpreting lesion effects than 
the knowledge of which nuclei are damag® 
per se. 

From the discrepancies in electrophysiolo 
ical evidence previously discussed (AdeY be 
al., 1960; Grastyan et al., 1959), it might }, 
expected that contradictory findings will o° 
cur in lesion studies on the hippocampus. 
not too surprizing then that, while Isaacs” 
Douglas, and Moore (1961) found no hipp? 
campal lesion effect on active avoidance, Hun’ 
Diamond, Moore, and Harvey (1957) did i 
such a performance (but not acquisitio” 
effect. Buresova, Bures, Fifkova, Vinogradov; 
and Weiss (1962), on the other hand, report, 
an impairment on an active avoidance acq" 
sition. Utilizing the same type of past! 
avoidance task, Kimble (1965 unpublished) 
reported no hippocampal lesion effect W. se 
Teitelbaum and Milner (1963) found Jarg? 
performance losses. Using a passive avoid 
ance task where a motivated approach | 
punished, Kaada, Rasmussen, and Kvel!™ 
(1962), and Kveim, Setekleiv, and Kaa i 
(1964) found no hippocampal lesion def 
cits while such deficits were reported 
Isaacson and Wickelgren (1962), and bf 
Kimble (1963). Results on hippocampal e 
sions and reversal learning, however, n 
consistent in showing reversal losses we 
out affecting performance of the task We ale 
reversed (Mahut & Cordeau, 1963; Ter 
baum, 1964; Thompson & Langer, pi 
Webster & Voneida, 1964). Although css 
tradictions are expected while using 4 8 jt 
technique such as lesioning, there is veT of 
tle clue within these studies for purpos®f 


Tesolution. In other studies, however, Kimura 
(1958) reported dorsal but not ventral hippo- 
campal lesions to disrupt passive avoidance 
oe Differential projections and ac- 
Wities from the dorsal and ventral hippo- 
campi have been reported (Elul, 1964). Ros- 
vola and Szwarcbart (1964) also reported 
io separation in different lesions, in 
anterior hippocampal lesions produced 
aie delayed alternation deficits than 
oe lesions, while lesions within hippo- 
mpal regions CA; and CA, produced greater 
m than in regions CAs and CAs. The 
dey finding may be related to the report by 
are Ae al. (1960) that regions CA, and CA4 
these respective inputs for late- and early- 
ave he elations. Although region CA4 should 
Osseg een the more relevant site producing 
ionis delayed alternation, from a vertical 
only = approach an attempt at lesioning 
region he CA; region may frequently involve 
ing in CA, as well. It would be more reveal- 
across this regard if animals were lesioned 
from ae fiber tracts entering region CA: 
OF ac € entorhinal cortex and were tested 
Since Auisition and retention of a CAR task. 
abolis entorhinal lesions apparently do not 
(Adey hippocampal slow wave a 
Sept 1 Sunderland, & Dunlop, 1957), while 
Petsch sions do (Green & Arduini, 1954; 
S ould, & Stumpf, 1960), entorhinal damage 
th Not affect acquisition of the CAR. Since 
inal ig Phase functioning is related to entor- 
N rete, Puts, such damage should affect losses 
y €ntion of the CAR. 


Pes Ten eamage to the medial septal 


W 
Vi s a 
effe. activi 


nuclei 


to abolition of hippocampal slow 


s ties, clear predictions of such lesion 
since °° behavioral tasks cannot be made 
epor : S previously noted, Adey et al. (1960) 
adprgs o, Slow wave activity during stabilized 
Note Aches only, while Grastyan et al. (1959) 
conditis OW waves to occur only during the 
ot bene Phase previous to stabilization 
Dace, Mance. Either there are different 
fieq į akers for the slow wave activities identi- 
Case p the two phases of responding, in which 
only -edial septal lesions will produce losses 
ron S acquisition and other situations o! 
Make 28 response control, or it is a pace 
“ther for both phases and is differentiated 

Y activating the hippocampus from 
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both indirect (entorhinal) and direct (fornix) 
septal inputs, or by some chemical or elec- 
trical specificity. Only very tentative conclu- 
sions bearing on these alternatives may be 
drawn from the literature reported thus far. 
Septal lesions have been found, as previously 
reported, to affect passive avoidance (Mc- 
Cleary, 1961), reversal learning (Thompson 
& Langer, 1963), extinction (Schwartzbaum, 
Kellicutt, Spieth, & Thompson, 1964), differ- 
ential inhibition (Schwartzbaum et al., 1964), 
and conditioned inhibition (Brutkowski & 
Mempel, 1961). Many investigators also re- 
ported facilitated performances as a result of 
septal lesioning of animals performing a CAR 
(King, 1958; Krieckhaus, Simmons, Thomas, 
& Kenyon, 1964; McCleary, 1961). Moore 
(1964), however, has reported septal lesions 
to produce deficits in the retention of a CAR. 
The results of Vanderwolf (1964) and of 
Kenyon (1962) were not in accord with his 
findings since they reported decrements in the 
but not the retention of a CAR. 
It must not be concluded, however, that both 
control and release functions are therefore 
mediated by the same septal brain cells; none 
of these studies, other than McCleary (1961), 
tested for both passive and active avoidance 
performances in the same animals. McCleary 
(McCleary, Jones, & Ursin, 1965) has again 
followed up these septal lesion study discrep- 
ancies in such a fashion. In this study, Mc- 
Cleary deliberately placed septal lesions in 
anterior and posterior groups.” He found 
anteriorly placed septal lesions, especially in 
the medial septal nucleus, to produce deficits 
on the passive but not on the active avoidance 
task. Posteriorly placed lesions, however, pro- 


duced no losses on the passive but ge 

deficits on the active avoidance ne spe 

findings are qute an oe Ta a o 
tion studies 0 i 

dege ad anterior septal lesioned sites to br 

differentially projecting to the hippocampt 

the anterior cingulate 


ix fibers, tO 

over fornix 7 te 
rea over the formx longus, and to = 
an . H . = 
mentum via the medial forebrain bundle. Cor 


i i f the precommissural 
ndingly, lesions o i ' 
m produce deficits on reversal learning 
(Thompson & Langer, 1963) and passive 


2R. A McCleary, personal communication, 1964. 
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avoidance responding (Kaada, Rasmussen, & 
Kveim, 1962); lesions of the anterior cingu- 
late cortex produce losses on a passive but not 
an active avoidance task (McCleary, 1961) 
and on a conditioned inhibition task (Brut- 
kowski, 1957); and lesions of the medial 
forebrain bundle produce losses on passive 
avoidance responding (Kaada, Rasmussen, & 
Kveim, 1962) and reversal learning (Thomp- 
son & Langer, 1963). Many of these brain 
areas are known to induce septal-hippocampal 
slow wave activity upon electrical stimulation 
(Green & Arduini, 1954). Powell (1963) also 
found that posterior septal lesions produced 
differential degeneration in the thalamic an- 
teroventral, reticular, and habenular nuclei. 
Consistently, lesions of the anteroventral tha- 
lamic (Thomas & Slotnick, 1962), as well as 
the habenular (Thompson, 1963), nuclei have 
been reported to produce deficits on a CAR 
performance. Gerbrandt (1965) reported no 
effect on learning a passive avoidance task in 
animals lesioned in habenular, as well as 
dorsomedial, thalamic brain areas. When the 
differential projections of lesioned areas within 
the septal nuclei are taken into account, then, 
findings suggest anatomical separations for the 
control and release behavioral functions. Con- 
tinued investigation is needed, however, to 
determine the relation between EEG activities 
as they originate from these areas and the 
functions they serve. 

The remainder of the lesion studies indicat- 
ing production of a neural bias in control and 
release systems have largely been reviewed in 
a previous paper (Gerbrandt, 1964). To re- 
iterate, these studies indicated a circuit, 
roughly comparable to that of Papez (1937), 
functioning to control behavior competing 
with a response to be stabilized. The non- 
specific projection system, on the other hand, 
was implicated in the release of stabilized 
responses. Aside from those lesion studies 
already reported in this paper, several 
relevant studies have appeared since the 

preparation of the previous review. The ex- 
pected findings that mammillary and mam- 
millothalamic tract lesions would not interfere 
with shuttlebox avoidance responding, have 
been supported by the investigations of Dahl, 
Ingram, and Knott (1962), and of Ploog and 
MacLean (1963). Again, the opposite results 


have been found by Thomas, Fry, Fry, Slot: 
nick, and Krieckhaus (1963), and by Kriek 
haus (1964); these authors found losses 
the retention but not the acquisition of shuttle- 
box and one-way active avoidance respon 
ing. A knowledge of fiber degenerations s¢¢ 
also to be crucial in interpreting these rest " 
Krieckhaus (1964) noted that the fibers a 
ning from the lateral mammillary bodies t0 i 
anterodorsal thalamic nuclei may be respon 
ble for these effects. Consistently, the anoa 
dorsal thalamic nucleus sends a heavy r 
jection to the posterior cingulate oi 
(Locke, Angevine, & Yakovlev, 1964), whi 
has been reported to produce losses in ac" i 
avoidance responding (Thomas & Slota g 
1963). In the rat, where strong differen” 
projections from the septum, via the m 
millary bodies, to the anteromedial thaw) 
nuclei have been described (Powell, 19 he 
the mammillary body and tract an duce 
anteromedial thalamic nuclei lesions P!" pd 
reversal (Thompson & Langer, 1963 65) 
passive avoidance learning (Gerbrandt, sei 
deficits. Future investigations are ic) 
needed which will determine whether Sa 
losses may be produced in the same an of 
where no losses are reported on other tas 
opposed function (Dahl, Ingram, & 
1962; Ploog & MacLean, 1963). Until t 
these hypotheses remain ad hoc. e 
The assumption of interactions Þet eft 
brain systems, rather than between cont 
has been illuminated other than by ©” A 
dictory data reported for the same nucletes 
mutual inhibition between two systems iv? i 
not only that a lesion in one syste bot 
produce a bias in the opposite syste™ ye 
that the bias may be negated either bY * 
pletion of function in the first system w 
tation), or by depletion of function by et 
ing, again, or depressing the second 59° gY 
Utilizing the known fact that posterior sie 
late lesions produce losses on a CA i 
Cleary, 1961; Thomas & Slotnick, 1962/4 
is interesting that drive deprivation, vo 
produced EEG arousal in the rat (Hock 0! 
1964), negates the effects of such P®9g3)’ 
cingulate lesions (Thomas & Slotnick; 
Further, anterior cingulate and septa gish 
which alone produce passive avoidancé te 
tion, are negated in this effect by P 
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cingulate lesions (Lubar, 1964; McCleary, 
con oe medial thalamic nuclei lesions 
soit ae: avoidance responding (Thomp- 
(1964 st Vanderwolf, 1963), Vanderwolf 
oe ) ias found that additional septal lesions 
tons i effect. Consistently, he found septal 
the Car One to lessen the acquisition rate on 
Superim even though these lesions, when 
proved posed on medial thalamic lesions, 1m- 
e Ses on the same task. It will 
rugs after the following section that many 
effects could be substituted for the drive 

63) obtained by Thomas and Slotnick 
1964 or the lesion effects of Vanderwolf 

) or Lubar (1964). 


T Druc EFFECTS 
actin Fea ies of neural biases by drug 
tion on ne another dimension of ac- 
ical dat s electrophysiological and neurolog- 
ês, i described to this point. In many 
HA sen investigations promise to clarify 
ee of systems where these systems 
1gg4 Wally or electrically overlap (cf. Fisher, 
Ug elfen Other cases, investigations involving 
the braj Cts provide excellent convergence On 
a relatio, System biases discussed previously 
he (196. to lesion and recording data. Gross- 
line im 4), for example, found that crystal- 
nergic Plants of cholinergic, but not adre- 
neg on imulants will interfere with perform- 
A the a CAR when these implants are placed 
medial septal region of the brain. As 
Sectroer teviously, it is quite interesting that 
highs 2Cephalographic data indicate that 
Oey, S¥8chronous hippocampal slow waves 


a . . : 
(Sailen, 2, result of cholinergic stimulation 


r ptal lesi Stumpf, 1957), and are blocked by 


Prt ons (Mayer & Stumpf 1958), which 
(Ring facilitate. ror ormana on a CAR 
entra] 1958), Using peripheral, rather than 
(t964 Injections, Chalmers and Erickson 
ato Faan that this net cholinergic stimu- 
ie a blocks performance on @ 

Ign (Gre which blocks cholinergic stimula- 
Seq) "teen, Maxwell, Schindler, & Stumpf, 
These” gg ain released CAR responding. 
T aia are substantiated in the similar 
1957 ler findings of Pfeiffer and Jenney 


ming p ieyers, R iciputi, and 
Choy 20 , Roberts, Ricipt)s 
Aatinerp, 964) have additionally found ua 


blocking drugs (atropine 
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scopalamine) disrupted only the acquisition, 
but not the retention, of a CAR. These effects 
were again substantiated in the findings of 
Herz (1959) and of Bignami (1964). Hearst 
(1959) also reported the cholinergic blocker 
scopalamine to disrupt differential responding, 
to produce an inability to withold responding 
between trials, and to produce deficits on 
response extinction. Buresova, Bures, Boh- 
danecky, and Weiss (1964) have found atro- 
pine to affect passive avoidance and extinction 
only when these reactions were not over- 
learned. Chlorpromazine, which apparently 
acts as an adrenergic blocker (Brodie, 1959), 
has been found to facilitate, or at least not to 
affect, performance of a passive avoidance 
(Naess & Rasmussen, 1958; Ray, 1964), and 
to block performance on a CAR (Chalmers & 
Erickson, 1964). Bindra and Mendelson 
(1962) have found, consistent with the notion 
of bias toward a control system, that chlor- 
promazine had greater retarding effects on 
performance where greater overtraining had 
previously occurred on the task. Ampheta- 
mine, which mimics to some degree the 
adrenergic stimulants (Brodie, 1959), has 
been found to differentially interfere with 
the inhibitory component of a discrimination 
(Cardo, 1959) and to facilitate performance 
on a CAR (Carlton & Didamo, 1961). , 
These results tend to indicate that the brain 
systems underlying control and release func- 
tions are biased by cholinergic stimulation and 
adrenergic blockade (Bindra & Mendelson, 
1962; Chalmers & Erickson, 1964; Grossman, 
1964; Naess & Rasmussen, 1958; Ray, 1964), 
and by adrenergic stimulation and cholinergic 
blockade (Buresova, Bures, Bohdanecky, & 
Weiss, 1964; Cardo, 1959; Carlton & Didamo, 
1961; Hearst, 1959; He 1959; mani 
Roberts, Riciputi, & Domino, 1964; Pfeiffer 
ively. Such mutual chol- 
Jenney, 1957), respectively n 2 
inergic-adrenergic inhibition systems have a N 
been reported by Grossman (1964) an, ‘is 
many studies on chemical stimulation of tat 
ing and drinking in rats. Since Grossman re- 
ported interference with a CAR performance 
in the same implant (cholinergic) that stimu- 
ia uch findings on the produc- 


lates drinking, S t od 
tion of consummatory responses might provide 


interesting converging evidence on the ana- 
tomical components of the brain systems 
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underlying the control and release functions 
described in this paper. It will be remembered 
that neurological evidence (Gerbrandt, 1964) 
indicated that a circuit comparable to that 
of Papez (1937) and a frontal to septal to teg- 
mental (Le Gros Clark & Meyer, 1950) path- 
way each seemed to function in increasing 
the likelihood of a response in competition 
with more likely responses. It seems now that 
this system may be biased by cholinergic 
stimulation, or by adrenergic blockade. Cor- 
respondingly, Fisher and Coury (1962) noted 
that drinking behavior is mediated by these 
same characteristically cholinergic Papez and 
medial forebrain circuits. 
While such remarkable support has come 
from independent investigations on the pro- 
duction of drinking, a systematic attack is 
needed to elucidate these relations in the 
same animals. Possible relations between the 
adrenergic systems of eating and the pro- 
posed system for behavioral release are in 
especial need of such clarification. Unlike one 
of the interpretations of Grossman’s (1964) 
data, cholinergic stimulation is proposed in the 
present paper to lead to CAR performance 
decrements not because of competition with 
motivated (drinking) behavior. It is hypothe- 
sized instead that at sites eliciting eating by 
adrenergic stimulation, no such competition 
should result from eating (or drinking). Also, 
Hernandez-Peon, Chavez-Ibarra, Morgane, 
and Iaria (1963) did not report such consum- 
matory effects in the cat, but their findings 
did support the notion of a cholinergic arousal 
inhibition system running from the septal area 
to the tegmentum. Again, cholinergic stimula- 
tion should lead to blockage of the CAR per- 
formance in spite of the lack of competing 
consummatory responses. Finally, those areas 
where anatomical overlap of adrenergic and 
cholinergic systems is known to occur (Gross- 
man, 1962) should be investigated for overlap 
of the proposed control and release functions 
thought to be related to these systems. Such 
investigations should reveal components of 
the systems of concern in this paper which 
have heretofore been masked by dimensions 
of technique. 


CONCLUSIONS 


In this paper, a descriptive model noting 
that discrete brain systems determine the re- 
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lease and control of stabilized responses us 
formed from suggestions and behavioral P* i 
rameters in the existing literature. Many ° 
the types of “contradiction” bereen stud 
in neurology (Moore, 1964 Vander 
1964), electrophysiology (Adey et al, 1 
Grastyan et al., 1959), and drug " 
(Buresova, Bures, Bohdanecky, & 
1964; Chalmers & Erickson, 1964) w T 
solved by this model. Since the mode, 
formed from existing suggestions in the ® ie 
ture, it might only be expected to differ 4 
ducother models by its added generali tye va 
ordering of release and control functio" ts 
behavior. In this sense the existing ™, 
differ by limiting that which is rela 
controlled. Response inhibition gelio! 
(McCleary, 1961; Schwartzbaum, 5 to “l 
Spieth, & Thompson, 1964) fail hy” 
count for the differences which ™ g 
obtained on acquisition and retention a 


e 
et al., 1964; Kenyon, 1962; Meye! je 
1964). As explanatory constructe nis i 
and controls over freezing (TPO mo 


Slotnick, 1963; Vanderwolf, 1964) a "st 
ity (Maher & McIntyre, 1960) nd 
offer clear predictions on acquisitio" : a 
tention effects. These constructs a ‘ait 
faced with cases where deficits are ° (ot 
without changes in freezing or motility d % 
brandt, 1965; Krieckhaus, 1964); oy w 
relatively species- specific to rat and © 
havioral changes. Release and cont. 
orienting (MacAdam, 1962; chalm? ie 
Erickson, 1964) or attentive shifting aii 
brandt, 1964) mechanisms may p° oy 
have the same predictive value as th¢ ate 
set forth in this paper; however, they ve 
present too easily used to settle inte A 
tions in an ad hoc fashion and differ DO 
ing the causal basis for effects on oth 
related effects (epiphenomena?). 52 ‘ely ril 
the present review the model is inte” eas 
valid description of the events of oe dy 
control of behaviors which may be ie oi E 
stability. Perhaps the causal natures oh a 
data will become clarified once 5° 1 Pe i 
scriptive model, although it may ne oh 
present one, is agreed upon as ftt A uf | 
the relevant facts. (For more ae! Pe l 
retical treatments of these data, 5 see 
by Sachs, 1965, and the book by * 
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SELECTIVE ATTENTION IN ANIMAL 
DISCRIMINATION LEARNING? 


N. J. MACKINTOSH 
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Two classes of experiment on the role of attention in discrimination learning 
are reviewed: (a) Investigations of the effect of attention on the amount learned 
about different cues have been interpreted as disproving noncontinuity 
(according to which animals attend to only one cue at a time). The fact that 
animals learn something about a second cue, however, does not prove that 
attention has no effect on learning, and more recent evidence shows that it 
does. (b) Ti animals do not automatically attend to all cues. part of what they 
must lam im order to solve a discrimimation problem is to attend to the 
relevant cue. Experiments om the acquired distinctiveness of cues, transfer 
along a comlimmaurn, and reversal learning provide evidence for the importance 


theory 


oi such classificatory learning. 


The concept of attention has occupied a 
rather humble place in the history of psychol- 
ogy. Seventy-five years ago William James 
(1890) wondered how it could be that “so 
patent a fact as the perpetual presence of 
selective attention has received hardly any 
notice from psychologists of the English em- 
piricist school [p. 402].” Later, it is true, 

| among the structuralists, the related notion 
| of “attensity” played a more important role: 
| but with the rise of behaviorism, attention 
| was once more banished from psychological 
| discourse, and even the Gestalt psychologists 
(less inclined than the early behaviorists to 
prejudge the value of a concept from its intro- 
spective overtones) had little room in their 
system for a selective determinant of percep- 
tion. One of the more valuable features of 
the impact of information theory on psycho- 
logical thinking has been in its insistence on 
the importance of a number of input variables 
in the determination of behavior, and the 
| concept of limited channel capacity, as has 
| been pointed out by Broadbent (1958), in- 
evitably leads to a consideration of the 


1 This paper was written as part of a project on 
Stimulus Analyzing Mechanisms, financed by the 
American Office of Naval Research (Contract 
N62858-3937) and the Nuffield Foundation. I am 
grateful to a number of investigators who have 
allowed me to sce prepublication copies of papers; 
to J. Mackintosh, E. M. Macphail, and N. S. 
Sutherland for their comments on an earlier draft, 
and finally to E. P. Lovejoy and, in particular, to 
N, S. Sutherland who are responsible for many of 
the ideas in this review. 
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importance of selective or filtering process 
in perception. 

Animals (particularly lower animals) h e 
nervous systems of limited size and ee 
fore of limited capacity for processing en 
storing information. Thus they are confron äi 
with the problem of selection. At some s 
they must discard irrelevant or redundant je 
formation so as not to interfere with et 
storage of important information. This m 
of argument would seem to provide a gener s 
rationale for postulating, as Broadbent 4° 
the existence of filtering devices in the mie 
ous system; and if this approach is justifie : 
it is undoubtedly of the first importance 
general behavior theory. To put it at ‘I 
simplest, if animals do not respond to G 
features of their stimulus input, then a sha! 
distinction must be drawn between the phys 
ical stimuli impinging on an animal in a” 
given situation and the effective stimul!® 
which controls the animal’s behavior in th® 
situation. Failure to consider this distincti?” 
might lead to explanations of behavior bei”? 
offered that are at best incomplete and @ 
worst totally misconceived, As Prokasy an 
Hall (1963) have recently pointed out, p 
failure may well invalidate much of the the? 
retical approach to the problem of stimul" 
generalization. e 

Tt may be granted, then, both that hen 
are plausible grounds for postulating a cent" 
mechanism of attention, and that the questi s 
of attention is an important one; it rem 


ave 
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ANIMAL DISCRIMIN: 


t a P y 
0 consider the nature of the experimental 


evi = > š à 
dence bearing on these issues. This review 
makes no att 


of the field; 
Which have in 


empt at an exhaustive coverage 
it is confined to experiments 
in discrinin Teba the role of attention 
his Aten fey learning and, even within 
in Bihan argely concerned with learning 
with i animals (although some work 
larta gi be mentioned). Animal 
topics where igen is iene of the few 
applied ay ai asec pt of aitentivn oa Reece 
Perhaps the Bae bo ad ide ie ol 
ee at eee tee seein 
Problems mal animals solving d iscrimination 
binging er es nt not attend to all stimuli im- 
subsequent] neir receptors, and this suggestion 
issue in i: y provided one of the points at 
and non de controversy between continuity 
latning ee theories of discrimination 
elements is controversy contained several 
tole of Si and above the question of the 
Among oe in discrimination Lectin. 
during em were: (a) hypothesis behavior 


discriminati aaee Da 
tently crimination learning—this has re 
matical form 


been the subject of several mathe- 
ulations (Levine, 1959; Restle, 

“Sie > (b) the rate at which learning occurs 
Usual] e name implies, noncontinuity theories 
idly Y regard learning as occurring very rap- 
964 Perhaps in a single trial (Estes, 1960, 
in dise and (c) the nature of the stimulus 
respond mination learning—whether animals 
ive a a the relationship between the posi- 
Values  PCBative stimuli or to their absolute 


mn Portant as these three issues are in their 
Tt is right, they are not of present concern. 
derive guable that the first two at any rate 
r i regar the 
centra] m what is here regarded as 


at „ Smtention of noncontinuity theory— 
a 


nimals respond selectively—but they 


Barres 
this €rtainly not necessary consequences of 


nee” Tt is difficult to see any logical 
nection between the question of the role 
Attention in discrimination learning and 
nature tion of the absolute or relational 
he EN the stimulus controlling behavior. 
Xue ent review, therefore, is concerned 


Xclug 1 
Tole Sively with the central question of the 


Tw, attention in discrimination learning. 


CUsseq. ad classes of experiment are dis- 
> rst, those designed to investigate 
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the effect of attention on the amount learned 

about any given cue: secondly, those con- 

cerned with the possibility that in order to 
solve a discrimination problem, animals must 
learn both to attend to the relevant cue and 
to establish appropriate choice responses. 
Finally, a number of two-stage theories of 
discrimination learning are briefly discussed, 
and an attempt is made to bring out some 
of their similarities and differences. 


EXPERIMENTS ON THE SELECTIVITY 
or LESENING 

The most obvious implication of the con- 
cept of attention is that animals should not 
learn equally about all the stimuli impinging 
on their receptors. Classically, three types of 
experiment have been generated by this point 
of view: (a) those on the presolution period, 
(b) those on what this reviewer will call 
incidental cue learning, and (c) those on 
single-stimulus pretraining. The plan will be 
to review earlier results, in their light to reject 
one version of noncontinuity theory as origi- 
nally formulated, to outline the elements of 
an alternative noncontinuity position, and, 
finally, to show how this reformulated theory 
is compatible both with early results and with 
more recent experiments. 


The Presolution Period 


There is no reason for dealing at length 
with this set of experiments, since it was un- 
necessary to perform them in order to dis- 
prove an extreme noncontinuity position. The 
rationale for the experiments rested on the 
observed fact that rats, trained by a non- 
correction procedure on a simultaneous visual 
discrimination, tend, while performing at a 
chance level of accuracy, to select a particular 
position consistently; in other words, they 
“adopt a spatial hypothesis” or “display a 
position habit.” The breaking of this position 
habit usually precedes perfect performance by 
a relatively small number of trials. From this 
observation (which was never in dispute), 
Krechevsky (1938) inferred that, while re- 
sponding to one stimulus, the rat “learns 
(wrongly perhaps) something about the sig- 
nificance of this particular stimulus . . . but 
does not learn anything about the ‘correctness’ 
or ‘wrongness’ of the to-be-finally-learned set 
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of discriminanda [p. 111].” It is not certain 
that this statement was ever a fair representa- 
tion of Krechevsky’s views; in the same 
article there are other statements putting 
forward a much more moderate position. 
However, the extreme position implied by 
this statement was in fact accepted as an 
integral part of noncontinuity theory, and in 
the remainder of this section it will occasion- 
ally, for brevity’s sake, be referred to as 
Krecheysky’s position. The particular state- 
ment quoted here seems to rest on two un- 
related assumptions: (a) that consistent 
selection of a particular position indicates that 
the subject is attending to position cues, 
and (b) that attention to Position cues pre- 
cludes the possibility of learning anything 
about other cues, The first assumption can 
perhaps be safely accepted. But there is no 
a priori justification for the second assump- 
tion, and evidence against it is provided by 
two simple observations of the behavior of 
rats learning simultaneous visual discrimina- 
tions. 

1. Mahut (1954) reported that while rats 
were still responding to position (i.e., during 
the presolution period) the latencies of their 
responses to the positive and negative stimuli 
(S+ and S—) began to draw apart. Although 
they selected S+ no more frequently than 
S—, they responded to it significantly faster, 
The writer has also observed this effect.2 
Thus the use of a (more sensitive) latency 
Measure reveals that a preference may be 
established for S+ over S—, when a choice 
measure indicates total absence of learning. 

2. If, while the subjects are consistently re- 
sponding to position, they learn nothing about 
the relevant cue, it follows that their first 
response away from their preferred Position 
should on the average be as often to S— as 
to S+. In fact, of 40 subjects trained by 
the writer on a black-white door discrimina- 
tion in a jumping stand, 30 developed con- 
sistent position habits (selected one position 
for at least 20 consecutive trials), and of 

these 30, only one first broke its position 
habit by selecting S— on its nonpreferred 
side. 

It is clear that rats will both respond 
systematically to position and learn some- 


2 Unpublished observations, 
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thing about the relevant cue at one e 
the same time. It follows that Reece 
second assumption is false and that ee 
of experiments designed to test this ext" 
noncontinuity position was redundant. hee 
In fact, of course, the majority of tible 
experiments have yielded results incor 
with Krechevsky’s point of view. The n 
of these experiments was to comparet Fit 
groups in their rate of learning a dist in 
nation A’-A"* One group was E 
throughout with A’ positive, the a 
(reversal) group was initially traine sitive 
A” positive, but was reversed to A No the 
while still responding at chance (befor! col” 
end of the presolution period). Result ty 
tradicting Krechevsky’s noncontinuity pet 
have been obtained in the following reuni 
ments of this design with rats: Ehren 51) 
(1948, Experiment TI), Gatling Hoe! 
Krechevsky (1938, Group III), McC nd 
and Pratt (1934), Ritchie, Ebeling: “p 
Roth (1950), Spence (1945), and T yal 
experiment with human subjects, n” 
(1952). Four experiments have show p 
sign of relevant cue learning during thé 63) 
solution period: Bower and Trabass0 sp” 
Bradley (1961), Ehrenfreund (1948, Ei 1): 
ment T), and Krechevsky (1938, Group tet 
Bradley used human subjects but, W pe 
of a reversal group, had a group for who afte! 
relevant cue was not introduced until ine 
10 trials. The fact that these subjects 1°% eel 
the problem as fast as a control grouP , {0 


10 trials, but only that the attention 0 
experimental group was attracted to the “ys 
vant cue by its novelty, In Ehrenfre™ d 
experiment with rats, the stimuli were ples 
on the top half of the doors of a ju™ po! 
stand, and it can reasonably be claimed 1 
the absence of presolution-period learning ‘af 


w 

3 Here, and elsewhere in this article, the oe 
notations are used: A, B, etc., are used to as si 
cues, or stimulus dimensions such as bright ne ig 
etc.; A’, A” are used to refer to particular ns hy 
embodying particular values of these dimension pit 
example, A’ black, A” white; if A’ signifies Osi? 
Square and A” a white square, then A., might iy 
a black diamond, that is, a stimulus differ! 


ast) 
A’ along a different dimension), In the P” jo" 


discrimination between A’ and A",” the first § 
is positive, the second negative. 
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due to the animals having failed to look at 
the relevant stimuli. The results, therefore, 
can hardly be adduced in support of non- 
continuity theory. 

There remain Krechevsky’s results with 
rats, and those of Bower and Trabasso with 
college students. Krechevsky used a very dif- 
ficult discrimination, Bower and Trabasso 
either a complex one or a very small number 
of reversed pretraining trials. Although, there- 
fore, as both Bower and Trabasso (1963) 
and Broadbent (1961) have suggested, it is 
Possible that selective attention may only 
Operate in complex situations, it is also, pos- 
sible that these experiments were biased in 
avor of producing negative results, since on 
any theory the amount of learning to be 
expected in these situations would be ex- 
tremely small. This, of course, raises the 
ay difficulty with this experimental de- 
‘8"—the problem of attempting to prove 
oe null hypothesis. There have been seven 
stimuli o8 demonstrations that reversal of 
nifies i Toning the presolution period can sig- 
a Tetard subsequent discrimination 
failine ìn two papers only has evidence 
found to support this generalization been 
es - This evidence is hardly sufficient basis 

hy theory, 

a ese experiments, therefore, could never 
P © Proved noncontinuity theory to be cor- 
the + ONcontinuity theorists set themselves 

ask of demonstrating absence of learning. 

Was always open to an opponent to claim, 
f any particular experiment which found 
Tevens, ference between reversed and non- 
had ed Subjects, that insufficient numbers 

een run or that an insensitive measure 
olia g had been used. Since, as will be 

En “Out later, the experiments do not 
ong oa good evidence that continuity 

st 1S correct, little more needs to be said 

them. 


Inci 
cidental Cue Learning 


mente logic of the presolution-period experi- 
Nats so vas derived from the way in which 
lems ve simultaneous discrimination prob- 
cally į NCe rats tend to respond systemati- 
ing” oe Position in the early stages of train- 
that “2S not entirely implausible to suggest 
sition © are attending predominantly to 

> and less to the relevant cue. Lashley 
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(1942) pointed out that on noncontinuity 
theory it should also be possible to preset 
an animal’s attention by specific experimental 
procedures. An animal that has been trained 
on a size discrimination between a large and 
small circle might be assumed to be attending 
to the cue of size. If now a large triangle is 
substituted for the large circle and training 
is continued, an incidental cue (shape) is 
being added to the primary cue (size). But 
if the subject attends exclusively to the 
primary cue, it should learn nothing about 
the secondary cue. (It is worth noting that 
attention here is not defined in terms of 
systematic responses, since the subject in this 
situation responds just as systematically to 
shape as to size.) 

Although fewer experiments have been per- 
formed on incidental cue learning than on 
the presolution period, their results are hardly 
more favorable to noncontinuity theory as 
originally formulated. It is true that Lashley, 
when he performed an experiment such as 
the one outlined above, claimed to find no 
evidence that incidental learning had oc- 
curred, but he used one insensitive and one 
quite inappropriate measure of learning. In 
his first test of whether the subjects had 
learned about the shape of the stimuli, he 
gave transfer tests with a triangle and a 
circle of equal size and found random re- 
sponding. But, as noted by Sutherland 
(1959), transfer tests with rats are too in- 
sensitive to reveal small amounts of learning, 
since (as happened on this occasion) subjects 
revert to position habits. His second test 
involved opposing the primary and incidental 
cues; the fact that the primary cue con- 
trolled behavior is neither surprising, nor is 
it good evidence for the total ineffectiveness 
of the secondary cue. 


Blum and Blum (1949) performed an 
experiment similar to Lashley’s and found 
some incidental learning to occur, but al- 
though they eliminated the undesirable fea- 
tures of Lashley’s design, they incorporated 
further inadequacies of their own. Rats were 
trained on the large-small circle discrimina- 
tion; once they had learned, a large erect 
triangle was substituted for the large circle 
and a small inverted triangle for the small 
circle, and training was continued. When 
given tests with triangles of equal size, the 
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subjects continued to select the erect triangle. 
Now it is generally accepted that in the 
jumping stand the rat’s behavior is controlled 
more by the bottom half of the stimuli than 
by the top half (Lashley, 1938). There is 
surely, therefore, every reason to believe that 
of two equally sized triangles, the erect (with 
a long base line) will look larger to the rat 
than the inverted (with a point as base). 

Fortunately, there have been two more 
recent and entirely satisfactory experiments— 
those of Bitterman and Coate (1950) and 
Hughes and North (1959). Each of these 
shows beyond any doubt that animals will 
learn something about an incidental cue 
(even if it is only partially, not fully, cor- 
related with reward). The conclusion again 
seems certain: In Lashley’s and Krechevsky’s 
formulations, noncontinuity theory has been 
disproved. 


Single Stimulus Pretraining 


The third experimental situation was origi- 

nally devised by Lashley and Wade (1946). 
If the subject is trained to respond to a 
single stimulus A’, and subsequently learns 
the discriminations A'-A", continuity theory 
will normally predict faster learning of the 
test discrimination if A’ is positive (non- 
reversal) than if A’ is negative (reversal), 
Lashley and Wade denied this. Their argu- 
ment rested (as did Krechevsky’s on the pre- 
solution period) on two unrelated assump- 
tions: (a) that during training with A’ alone, 
a sloping gradient of generalization along the 
A’-A" dimension would only be formed if the 
subject classified A’ as falling at some point 
along the dimension—that is, attended to this 
feature of A’; and (b) that the subject would 
not attend to a particular dimension unless 
given discrimination training between two 
points along the dimension. Hence no gen- 
eralization gradient would be formed after 
single-stimulus pretraining. It is important to 
see that these two assumptions are indepen- 
dent. A demonstration of sloping gradients of 
generalization following single-stimulus pre- 
training might disprove the second assump- 
tion but leave the first intact. 

Lashley and Wade briefly reported a num- 
ber of experiments using variously rats, mon- 
keys, and chimpanzees, in none of which were 
significant differences between reversed and 
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nonreversed groups found (note again the K 
of an experimental design which can on 
show noncontinuity theory to be comeca 
proving the null hypothesis). Two of oie 
rat experiments were repeated by sige 
(1948, 1951), who in each case found on- 
nificant differences in favor of the n y 
reversed groups. Grice’s results are fegurra 
quoted (Kimble, 1961, pp. 369-371: em 
1953, p. 449) as providing decisive ae i 
of Lashley and Wade’s position, so i 
worth showing that his experimental de = 
is invalid. For the sake of illustration a 
need only consider his first experiment, th 
though the same conclusions apply te two 
second. Mackintosh (1965d) trained irde 
groups of rats to approach a white ee 
and avoid a black door: for one group Lae 
reversal) the circle was 8 cm. in diam a 
for the second (reversal) group @ ed 
circle was used. Both groups then Jen the 
to approach the 8-cm. circle and avoid 
5-cm. circle, and the nonreversal 2! 
learned faster than the reversal BroUP di 
when a third and fourth group were ore e 
the third initially to approach the 5-cm. “tne 
the fourth with the 8-cm. circle, and ircle 
were both trained to approach the 5-cm. ri 5 
and avoid the 8-cm. circle, both BrO n 
took nearly three times as long to ag 
this discrimination as did the equivalents |, 
Grice’s two groups. Yet according to Gr a 
analysis, the third group was tested 0” Je 
nonreversal. These results become intelli! 
aS soon as one sees that in original trains 
all four groups were learning to appr 
the brighter of two stimuli (a black door W! 
a white circle) and to avoid the dark. 
stimulus (a black door). Grice’s two gro" 
were then trained to approach the late 
(brighter) of two white circles and there!? 
learned a nonreversal, while the third a 
fourth groups were trained to approach for’ 
smaller (darker) of two circles and there pd 
were reversed. Thus Grice’s (and Lashley “pe 
Wade's) experimental situation involves js 
successive learning of two brightness est 
criminations, and the results are not 4 
of the effect of single-stimulus pretra! 
at all. r of 

There remains, however, a large numbe’ ge 
relevant experiments, which for the first o 
are not unanimously embarrassing t° 


—— 
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continuity theory. Although any experiment 
(eg, those utilizing pigeons and the spectral 
dimension, following Guttman and Kalish, 
1956) which succeeds in demonstrating ortho- 
doxly sloping gradients seems to provide 
Prima facie evidence against Lashley and 
Wade's Position, there have been an appreci- 
able number in which no evidence for a gradi- 
ent of generalization could be obtained: 
(a) Between them, Jensen and Cotton (1961) 
and Thompson (1962) tested for size gen- 
oo in 413 rats without finding any 
ent, (b) „McCain and Garrett (1964) 
iñ tile ec gradient of brightness generalization 
could east (c) Jenkins and Harrison (1960) 
tön cm no evidence of a gradient of audi- 
ileus equency generalization in pigeons. Also 
nt here are those experiments which 
ned bi to show that in order for a neutral 
teinfore to be established as a secondary 
lis Te it must be a discriminative stimu- 
ponds. for secondary reinforcement may 
aq as tests of generalization, de- 
eta a as they do either on showing gen- 
Prefere 10n decrement effects in extinction, or 
results nces in a choice situation; thus, the 
Crock of Ferster (1951), McGuigan and 
may e (1958), and Notterman (1951) 
ate be taken as showing the impor- 
gradi of discrimination training for producing 
> aclents of generalization. 
fiom an and Rudolph (1963) have 
ments y Suggested that even in those experi- 
absen ostensibly showing gradients in the 
Well lg discrimination training, there may 
crimi been an inadvertent element of dis- 
show ation involved. They noted that pigeons 
Sener ey sloping gradients of spectral 
a key pap after being trained to peck at 
Stadie a given wavelength, but fail to show 
Nts of auditory frequency generalization 
ing trained to peck at a key in the 
ce of a continuously sounded tone of 
frequency, They suggested that the 
è Emene an is more likely to introduce 
er, sin of discrimination training than the 
us w ce in the former the positive stimu- 
Wher 


p 


after b 
resen 

Siven 
Ormer 


W only be perceived by the subject 
the subject is engaged in key pecking, 
€refore likely to be rewarded, while in 

re Situation the positive stimulus will 

the subj ved by the subject whether or not 


lect is key pecking. They argued that 


the probability of discrimination training oc- 
curring when a visual stimulus is used would 
vary inversely with the size of the stimulus 
concerned—the larger the stimulus the more 
likely the subject would be to see it when not 
pecking. Their results showed that in accord 
with this suggestion the slope of a generaliza- 
tion gradient after single-stimulus training 
to a given wavelength also varied inversely 
with the area of the stimulus. 

It may be that this ingenious suggestion 
cannot account for all instances of generaliza- 
tion gradients; but it is clear that the occur- 
rence of discrimination training does have 
a marked effect on generalization, and to this 
extent it seems that Lashley and Wade (1946) 
were right. But Lashley and Wade insisted not 
only on the general importance of discrimi- 
nation training, but also that such train- 
ing should be between two points along the 
same dimension as was to be used in generali- 
zation testing. It is true that if such relevant 
discrimination training is given, the change in 
slope of the gradient is very considerable— 
and greater than can reasonably be explained 
in terms of the summation of excitatory and 
inhibitory tendencies (Jenkins & Harrison, 
1962); but in the majority of the studies 
cited above, discrimination has usually been 
between the presence and absence of the 
stimulus. On the other hand, when discrimina- 
tion training is given along an entirely differ- 
ent dimension, it has been shown to be totally 
ineffective in producing generalization gradi- 
ents (MacCaslin, Wodinsky, & Bitterman, 
1952). 

The evidence from generalization experi- 
ments, therefore, is perhaps moderately favor- 
able towards noncontinuity theory. Neverthe- 
less any verdict based on the total evidence 
so far reviewed can only be unfavorable. 
There is no possible doubt that some learning 
about the relevant cue does occur during the 
presolution period; there is equally no doubt 
that rats learn something about a second, in- 
cidental cue when it is paired with a primary 
cue. It is, on the whole, unlikely that the very 
strong claim made by Lashley and Wade 
(1946) about generalization can really be 
substantiated. On the other hand, some of 
the generalization experiments (e.g., Mac- 
Caslin et al., 1952) are by no means easy to 
reconcile with continuity theory, and it looks 
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as if neither theory is in a very satisfactory 
position. This conclusion may seem unwar- 
ranted to those who have regarded the early 
experiments (especially those on the presolu- 
tion period) as good evidence in favor of 
continuity theory. It is the thesis of the next 
section that of not one of the experiments so 
far mentioned can this be said. 


REFORMULATION OF NONCONTINUITY 
THEORY AND SOME NEW EXPERIMENTAL 
EVIDENCE 


The early experiments may have disproved 
Lashley and Krechevsky’s noncontinuity 
theory; but this would only mean that they 
have thereby proved continuity theory, if the 
two theories are not only mutually incom- 
patible, but also cumulatively exhaustive. 
This they clearly are not. A crude representa- 
tion of the two positions would be that non- 
continuity theory states that animals attend 
to, and therefore learn about only one cue at 
a time, while continuity theory states that 
animals learn equally about all cues imping- 
ing on their receptors. Stated thus it is obvi- 
ous that there are alternative positions; for 
example, animals might simply attend to some 
cues more than to others, so that if the sub- 
ject is attending predominantly to Cue 4, it 
will learn more about Cue A than about Cue 
B—and less about Cue B than it would have 
done had it not been attending to Cue A. But 
there is no need to insist that the subject 
either learns everything or learns nothing 
about a given cue. 

Although in an empirical sense this may 
appear to be a compromise position between 
the two extreme poles of continuity and non- 
continuity theories as categorized above, the 
theoretical position that emerges clearly has 
more in common with noncontinuity theory 
than it has with continuity theory: Emphasis 
is still placed on a central process of attention 
which determines (although not now in an 
all-or-none manner) which stimuli out of all 
those impinging on it will prepotently control 
the animal’s behavior. In fact, as already men- 
tioned, Krechevsky (1938) himself, in the 
same article as that from which the extreme 
noncontinuity statement quoted above was 
taken, specifically envisaged this possibility: 

A ight be reacting to an “irrelevant” set of 
Saad, as well as to the relevant set, and 
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would thus be learning something [italics added] 
about the final solution even while seeming to be 
carrying out an “irrelevant” hypothesis. . . . HON 
ever... since, during the so-called presolutio 
period, the animal does spend some time responding 
to “irrelevant” discriminanda, the residual neuro- 
logical effect of a “rewarded” or a “punished Ji 
sponse is not the same [italics added] during ing 
period as during the period the animal is a 
attention to the relevant discriminanda [pp. 108-109): 


This position, therefore, which we may al 
a modified noncontinuity theory, states simp! y 
this: that animals do not classify their sim 
lus input with equal effectiveness in all po 
sible ways at once, and it should therefore i 
possible to influence what an animal ae 
to by appropriate training procedures. ie 
experiments so far reviewed did not prov! he 
a test of this position. The finding that ssi? 
mals learn something about the relevant e 
during the presolution period shows that } 
this situation they do not classify their mpu 
in only one way at a time; but (as pointe" 
out by Sutherland, 1959) it does not answe 
the question whether animals responding 
systematically (i.e, attending) to posity 
learn less about the relevant cue than we 
animals. None of the early experiments, T 
fact, tested the possibility that attention mig 
have some effect on learning; they have oe 
been concerned with investgating whether ot 
mals either attend to a given cue, or lea" 
nothing at all about it. Thus the results a 
these experiments do not contradict expect? 
tions of modified noncontinuity theory, 2” 
by the same token do not provide evidence ! 
Support of continuity theory. For accordi” 
to continuity theory, not only should anim 
learn something about, for example, visu? 
cues when attending to spatial cues, they 
should learn as much about the visual cues 
they would have done had they not been 2” 
tending to the spatial cues. According to com 
tinuity theory there is no such thing 4° 
mechanism of attention over and above 
peripheral orienting responses, no distinct! 
between primary and incidental cues, and ” 
possibility, therefore, of demonstrating 
difference in the amount learned about a aes 
cue depending on the direction of the anim@ 
attention, ri- 

It is clear that a different type of expe t 
mental design is called for, one that will 


: for 
for differences in the degree of learning 9” 
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mae primary and incidental cues, or dis- 
mad whether animals classify their input 
mae ely in all ways at once. The 
aa tte E theory implies that the 
ce ` y an animal is to classify its input in 
it in Bi the less likely it will be to classify 
Ea a The most direct test of this 
ination on is to train animals on a discrim- 
a whit yii two relevant cues, for example, 
vertical horizontal rectangle versus a black 
criterio rectangle, and once they have reached 
much th to use transfer tests to discover how 
rately er, have learned about each cue sepa- 
any pa ae hypothesis predicts that the more 
the tticular subject has learned about one 
ave le; ues (e.g., brightness), the less it will 
is oa about the other (orientation). 
onda. that there should be a negative 
n One ion between individual subject’s scores 
is ba, and their score on the other, and 
lan iction has been confirmed (Suther- 
as re Mackintosh, 1964). Reynolds (1961) 
oin aed results from two pigeons that 
nim: ihe same direction. 
S e als do not, therefore, learn about all 
fa and this suggests that it should 
tion Bo le to manipulate an animal’s atten- 
about if, to affect how much it will learn 
perine rent cues, Consider the following 
60 trig ental situation: Rats were trained for 
ertia, this same white horizontal black 
Mount 1 iscrimination, and tested for hare 
a ere about orientation. Pretraine 
5 ea brightness discrimination (ié; 
ĉas to brightness cues), orientation be- 
animals econdary, incidental cue, and although 
thing S Siven this pretraining learned soni- 
adout orientation, they learned signifi- 
(Mag,; S than animals not so pretrained 
Yer o ne in press), Similar results have 
kinto med with octopuses by Sutherland, 
With ù sh, and Mackintosh (1965), and 
Wear beings by Eckstrand and Wick- 


Cue: 


es ja 
d S * experiments are most directly related 
ha Problem originally raised by Lashley 
his experiment on incidental cue 


the a relevant to 
eem 
y also sS 


inuity theory in effect argues that 
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attention will not influence the amount 
learned about a given cue; these experiments 
show precisely that it is possible to preset an 
animal’s attention, and thereby to influence 
the amount learned about various cues. It 
turns out to be possible to demonstrate the 
equivalent effect in the area of single-stimulus 
pretraining. 

Lashley and Wade’s (1946) account of 
generalization seemed to rest on two as- 
sumptions: the necessity of appropriate classi- 
fication for the subsequent appearance of a 
gradient of generalization, and the necessity 
of appropriate discrimination training for the 
occurrence of such classification. The experi- 
ments reviewed in the previous section were 
entirely concerned with the second of these 
assumptions, but as was pointed out the two 
assumptions are in fact independent, and it is 
of equal interest to test the first. When rats 
were trained to jump to a single rectangle of 
given orientation and brightness, they showed 
gradients of generalization along both orien- 
tation and brightness dimensions (Mackin- 
tosh, 1965a). They therefore classified a single 
stimulus along several dimensions at once (see 
also, Butter, 1963), a result that can only 
with difficulty be reconciled with Lashley and 
Wade’s position. However, pretraining sub- 
jects on a successive brightness discrimination 
in the above situation not only oar a 
significantly steeper brightness a om 
SS jent, but also a significantly iss Srp 
o i the orientation dimension 
gradient along z 
@ sentation pretraining had, precisely the 
(orien e effect). Thus rats trained to classify 
along one dimension are less likely wa 
along another than rats given no Sp 


ing. 
Further Variables Affecting Attention 

provide direct support 
d noncontinuity thi 


lly direct 1 
alee ean 5 ne Having established 


ae aan ed about any cue can be 
the next step would 
te what variables in- 
the range of attention. 
if successful, would 
idence against continuity 
redict that incidental 


i ust p 
— fluenced by those fac- 


theory; ] 
learning will only be m 
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tors such as number of trials, or delay of 
reward, which are important in any learning. 
This point was first recognized by Bruner, 
Matter, and Papanek (1955). They trained 
rats under high or low drive, for 30 or 100 
trials on a brightness discrimination in a 
4-unit maze, then gave 20 trials with a spatial 
cue also made relevant, and finally tested for 
the amount learned about this incidental spa- 
tial cue. They found that both increases in 
drive level and the amount of prior training 
decreased incidental cue learning. 

Two further variables hay 
identified. Mackintosh (in p! 
on an orientation discrimination between a 
gray horizontal and a gray vertical rectangle. 
After learning, the brightness of one rec 


e been definitely 
ress) trained rats 


tangle 
was changed to black, of the other to white; 
training was continued with this incidental 


factors had a significant effect: (a) 
the difficulty of the original orientation dis- 


crimination, the more the subjects picked u 
about brightness; (b) If the rec à 


The greater 


problem difficulty 
Terrace (1963): Pi 
color discrimination; 


errors on the orientat 
if the color difference was abolished a 
a number of errors followed. Gradual fading 
out of the color cue would correspond to mak- 
ing the color discrimination more difficult, 
At least two experiments on single 
pretraining have shown that probl 
culty is an important determinant of 
of generalization gradients along a 
dimension. MacCaslin et al. (1952) and 
Perkins, Hershberger, and Weyant (1959) 
have tested for generalization between A’ and 
A,, after discrimination training between 4’ 
and A”. Increasing the difficulty of the A’-4" 
discrimination produced a significantly steeper 
gradient of generalization between A’ and A,,. 
Such results imply that the greater the diffi- 


bruptly, 


-stimulus 
em diff- 
the slope 
different 
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culty of the relevant discrimination, the n 
likely subjects are to attend to other, m 
vant features of the stimuli, and therefore d 
more likely they are to classify the St a 
along other dimensions (including that R 
quently utilized for the generalization test i 
The explanation of the results revieweoi 
this section has not been fully worked r 
There has been a temptation to state (Ea h 
brook, 1959) merely that such and SA 
factor “narrows the range of cue utilizat! 0 
or that under certain circumstances rats ich 
not “actively inspect” the stimuli; any tive: 
statements, of course, are merely descrip 


2% af 
———— 
fa 


EXPERIMENTS ON Twwo-STAGE 
LEARNING MODELS 


. „ning 
The Importance of Classificatory Lear"! 


ibt 

At this stage there should be little = ut 
that animals do not classify the stimulus a ec 
equally in all ways at once. They react $ by 
tively, and this selectivity can be affecte t0 
various training procedures, But this see" al 
have an important implication, for if a ut 
do not automatically classify their arel 
simultaneously in all possible ways, it 15 A is 
possible that an animal confronted with = Jas 
crimination problem will not necessarily iste” 
sify the stimuli appropriately; it might reS ign 
the position of the stimuli, their orien!” js 
and their shape, when in fact the proble" 
a brightness discrimination, From this it as? 
lows that part of what a subject must sel! 
i ation is preci 
mulus dimen or 
RE 
ention, Even ‘al 
nd to the rele 
of discrimin® pe 


ending must still inct te 
to some high value, in order that acc" w 
performance should occur, Without being r i! 
mitted to an extreme noncontinuity theory i 
remains possible to claim that part of W ras 


Subject must learn ig the appropriate © 

fication, fo! 
Ts it possible then to provide evidenc? nf? 
e occurrence of such classificatory lea” 90 
Can one experiment 
components į 


cl 


th 


ally separate out ft 
n discrimination learning? pil 
asses of experiment have been devised 
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attempt to do just this: those on (a) the 
acquired distinctiveness of cues, (b) transfer 
along a continuum, and (c) reversal learning. 
sh aa case the credit for seeing their im- 
factory poe for performing the first satis- 
wt thee eee belongs to Lawrence. One 
Ran the ‘as (that of reversal learning) has 
Yacer aie og of an enormous volume of 
a ion arch, and an attempt will be made 
some apparent contradictions. 


Th ; Pa 
© Acquired Distinctiveness of Cues 


i on performance on a discrimina- 
ing to an AE nds on two processes (learn- 
then e i learning choice responses), 
epend 1 of learning a given problem will 
t wa the amount of learning of each 
of each ha oceng (thus equally on how much 
he ne = already occurred and will transfer 
tS ioien 

Ponses to othesi, animals cannot learn re- 
ee stimuli unless they are also attend- 
wh Ose stimuli, but there is no reason 
brightnes, should not learn to attend to a 
tesponses cue, without learning the specific 
Settings ee for solving the normal 
Siaa brightness discrimination. Thus 
lar Stee is trained to attend to a partic- 
yp us dimension, by learning to make 
the ne of response to stimuli varying along 
Secon, ension, and is then confronted with a 
S th NE where the relevant dimension 
a the type of response to be 
m sh ifferent, solution of the second 

La Wren, se be facilitated. 
qire ta first experiment on the 
tect est rt S of cues provides a 
twad on a this hypothesis: Subjects were 
®en y’ ana re discrimination be- 
a gp ach 4’ a Gie., they were trained to 
it UCcessiye is avoid A pE they then learned 
oae stimulus ination (i.e., to turn right 
ieS WaS Prese Was present, and left if an- 
Sup; UCCessive a the relevant stimuli of 
Wey Sets learned 'scrimination were A'-A”, the 
i erent far much faster than if there 
Rth Were et evant stimuli, or than if 
ing E response r However, simply learn- 
tarni Could ot approaching A and avoid- 
NE the ave been of no assistance in 
oe discrimination. The 
h mon to the two discrimina- 


a 
they shared the same relevant 
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cue, and the positive transfer must have been 
due to the subjects’ having learned to attend 
to this cue, and having transferred this learn- 
ing between problems. 

In Lawrence’s (1950) second experiment, 
animals were trained on a successive discrim- 
ination, with A'-A” the relevant dimension, 
and B'-B” irrelevant. They then learned the 
combined simultaneous discrimination of A'B’ 
versus A”B” and were finally given tests to 
determine the basis of their solution of the 
simultaneous discrimination. The tests estab- 
lished that the relevant successive cue was 
more effective than the irrelevant one. 

It is clear, therefore, that irrespective of 
the responses required transfer can occur be- 
tween discrimination problems on the basis 
of the relevant stimulus dimension. Learning 
to attend to a particular cue in one situation 
will transfer to another situation. Basically 
similar results have been obtained with rats 
by Mackintosh (1964) and Shepp and Eimas 
(1964); and with college students by, among 
others, Eckstrand and Wickens (1954). 

Conversely, it ought to be possible to train 
animals to ignore a particular cue, and such 
training would be expected to retard the 
learning of a discrimination with that cue rele- 
vant. One possible way of training a subject 
not to attend to a particular dimension might 
be consistently to reward both values of that 
dimension shown to the animal, that is, to 
provide nondifferential reward on the dimen- 
sion. (This possibility is implied by the sug- 
gestion put forward by Sutherland [1964] 
and by Wyckoff [1952] that what “switches 
in an analyzer” or “increases the probability 
of observing responses” is differential rein- 
forcement of different stimuli.) If this is so, 
then such a procedure should retard subse- 
quent discrimination learning, and results 
obtained by Siegel (1945) support this sug- 
gestion. However, later work by Bitterman, 
Calvin, and Elam (1953) and Bitterman and 
Elam (1954) seemed to show that the effect of 
nondifferential reward was even more harmful 
if given on a set of stimuli different from the 
test set. Although this effect has not been con- 
firmed in other experiments (neither Billings- 
ley, Feddersen & Bitterman, 1954, nor Calvin, 
1953, found any difference), it does seem that 
if one wishes to claim that training to ignore 
a particular dimension will retard subsequent 
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learning of that dimension, then one cannot 
also claim that the nondifferential reward 
technique is one way of training animals to 
ignore a dimension. 

It remains possible that other ways of 
training animals to ignore a dimension might 
be more efficacious, and one obvious way 
would be to train animals with the dimension 
in question irrelevant (each stimulus equally 
often rewarded and not rewarded). An experi- 
ment by Goodwin and Lawrence (1955) 
showed fairly directly that subjects will stop 
attending to an irrelevant cue: Rats were 
trained on one discrimination, then on a 
second with the cue of the first present but 
irrelevant, and were finally retrained on the 
first problem either with the stimuli retaining 
their original signs, or with the stimuli re- 
versed. Training on the second problem with 
the cue of the first present but irrelevant, 
should, if the subject continues to attend to 
that cue, tend to equalize the habit strength 
to each of the stimuli, and therefore eliminate 
any difference in the final stage between 
learning either the reversal or the nonreversal 
of the original problem. Goodwin and Law- 
rence found no such effect, and concluded that 
in the second stage subjects simply stopped 
attending to the original cue, and thus failed 
to equalize habit strength to the original S + 
and S —. Although this conclusion has been 
disputed by Goodrich, Ross, and Wagner 
(1961) and by Ross (1962, 1963), Mackin- 
tosh (1963c) has obtained results which sup- 
port Goodwin and Lawrence’s analysis. 
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white irrelevant, one with no brightness og 
present, and then trained them on the black- 
white discrimination. The first group mad 
15.08 errors, the second 12.92 errors during 8 
test trials—a difference that fell short of sig- 
nificance. However, the difference was in t 
predicted direction, and it seems possible on 
it was underestimated, since the horizonta i 
vertical problem was actually learned far 
by the group with black-white irreleva ti 
This is a very improbable result since yer a 
and Bitterman (1953), for example, wen 
shown that the presence of an irrelevant K 
can significantly retard discrimination lea!” 4 
ing; thus it is probable that in Jeeves auf 
North’s experiment the group trained with af 
irrelevant cue contained faster learners, ne 
therefore, that had the two groups b ue 
matched in the first place, the irrelevant “0k 
group would have been significantly infer 
to the control group in the final test. vie 
In summary, then, there is clear-cut 4 
dence that pretraining animals to attend t 
given dimension will benefit learning ” 
discrimination involving that dimension- e 
respective of the relationship betwee? ere 
responses required in the two problems. T bis 
is some suggestion that the converse of ail 
is also true: that it is possible to pret nd 
animals to ignore a particular dimension, a 
that such Pretraining retards learning °, yt 
isctimination involving that dimension; eat 
the evidence on this last point is by no ™ e 
satisfactory, and there is need for furthet 


search, 
There is independent evidence, therefore, 
that subjects trained with a cue irrelevant Transfer al aCouta 
learn to ignore that cue; thus it should also i piri X g” 
lhe possiile to @emonsttuatte that twamine with _ im the previous Section, the argument w 
a particular cue irrelevant retards subsequent in effect, that r 


learning with the cue relevant. There is evi- 
dence from Calvin’s (1953) experiment men- 
tioned above to support this: After receiving 
16 days of nondifferential reward on hori- 
zontal-vertical striations, subjects learned 
the striation discrimination significantly faster 
than after receiving an equal number of non- 
differential trials of which one in six was un- 
rewarded (i.e., after pretraining with hori- 
zontal-vertical irrelevant). Jeeves and North 
(1956) claimed to be unable to find such an 
effect. They trained two groups on a hori- 
zontal-vertical discrimination, one with black- 


ate of learning should er 
previously established P” aot 
ct’s attending to the rele pp 
cue. In this section, a similar argument © yp 
plied to accuracy of performance. For 4 “cy 
Ject to perform with a high degree of ac“ ci 
on any discrimination, the present hypo% pig 
States that the subject must both have # 
probability of attending to the releva” 
and a high probability of selecting S + ga 
that it is attending. Conversely, inac® ot) 
performance might be due to either (OF ka 
of these probabilities being substantia 

low unity, 


fluenced by the 
bility of a subje 
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xy. Consider an animal trained on a discrimina- 


tion between two neighboring shades of gray. 
Tom the close similarity of S + and S — two 
ings are likely to follow: (a) There will be 
me overlap in the central representation of 
we S—, so that even if the animal 
will tives the relevant cue on every trial, it 
cisely E r attain 100% accuracy; (b) Pre- 
X een mistakes will still be made 
will sg animal is attending appropriately, it 
animal be to attend consistently. Thus an 
pe ae orming at a low level of accuracy 
Hibecang l cult discrimination does so both 
] relevant lt does not consistently attend to the 
attendi cue and because, even when it is 
tinguish the stimuli are sometimes indis- 
Sons aa le. Removal of either of these rea- 
heals should improve accuracy. 
Consistent accuracy due to attending more 
ing the 5 y should be achieved by pretrain- 
the sam ubject on an easier discrimination in 
S e dimension until a high level of 
fs ditis e attained, and then approaching 
teasing mi discrimination by gradually de- 
Stimuli E e difference between the training 
the, two. arly experiments by Volkmann on 
1890) Point threshold (cited by James, 
discrimi Y Pavlov (1927) on circle-ellipse 
(193g Nation by a dog, and by Lashley 
n a shape discrimination in rats, 
t in the predicted results do occur, 
st Seas experiment (1952) provided 
s A AN results on what he called 
3 sien a continuum.” The basic re- 
d od (19595 been repeated by Baker and 
Scrimi ) with human adults and a pitch 


So; 


q Nation, and se ound amet 
(Gray macaa tania Bose North 
N With , 

a > and Mees? and Sutherland, Mackin- 


ki 


13 


an 


showed immediate and signiñcant increases in 
accuracy. 


Reversal Learning 


If an animal is trained to approach Stimu- 
lus A’ and to avoid A”, and is then trained on 
the reversal of this problem, that is, to ap- 
proach A” and avoid A’, then a single-stage 
continuity theory of discrimination learning 
predicts that prior A’-4” training can only 
hinder A’-d’ learning. For the type of two- 
stage model considered here, however, the po- 
sition is more complicated, but for that very 
reason more interesting. For on this account, 
although the choice responses required in 
original learning and reversal are antagonistic, 
the relevant cue remains the same throughout, 
and therefore the attention established by 
original training is entirely appropriate for 
the reversal learning. 

From this, it does not follow that reversal 
learning is bound to be very easy, let alone 
that rate of learning A”-A’ should actually be 
increased by prior training on A’-A”. Although 
such a claim seems to be made by H. H. 
Kendler and T. S. Kendler (1962), it is surely 
fallacious. At the beginning of reversal an 
animal is fairly consistently unrewarded; on 
a two-stage model this will lead to extinction 
of both attention and choice responses. Li the 
animal ceases to attend to the relevant cue 
before it has extinguished its original choice 
responses, then reversal learning can only be 
slow and laborious, since both attention and 
choice responses must be built up de novo. 
If, om the other hand, the oviginall choice: 

ESpOMses awe extimeershed while the subject 

is still attending to the relevant cue, then it 

has only to acquire newly appropriate choice 


Ri Using intosh (1963) with octopuses, 
à 955 k ape discriminations. Lawrence 
“tho a a the difficulties confronting 
aj Inuity theory in att i 
i © results. y In attempting to 


EAR i 
nines to show that in a difficult dis- 
gts failu , accuracy also suffers from a sub- 
” a distinguish between S -+ and 
Gal en attending appropriately. In 
and et al. experiment already 
nee trained from the outset on 
TE 'scrimination were finally given 
Porty With the easier stimuli; without 
mity for further learning, they 


responses, and the reversal is learned easily. 
At first sight, it does not seem as if a two- 
stage model must necessarily predict one of 
these alternatives rather than the other; thus 
there is no need for it to predict that reversal 
learning should be relatively easy. It is clear, 
however, that one definite prediction can be 
made: If attention extinguishes before choice 
responses, reversal will be slower than if 
choice responses extinguish before attention, 
and any procedure that increases the proba- 
bility of attending without equally increasing 
response strength, will facilitate reversal. 
Once again it is Lawrence (1950) who 
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seems to have been the first to recognize the 
significance of reversal learning. He trained 
subjects on a successive discrimination with 
Cue 4 relevant and Cue B irrelevant, then on 
a simultaneous discrimination in which both 
were relevant, and finally on either A or B in 
isolation. In this last stage, half the subjects 
were trained with the same stimulus Positive 
as had been positive in the Previous stage, 
while the other half were trained on the re- 
versal. As has already been noted, the non- 
reversal was learned faster on Cue 4 than on 
Cue B, implying that subjects had learned the 
combined discrimination largely in terms of 
Cue A. What was more surprisi 


ing in rigor; at least, how 
theory seems capable of pr 
tion, and this į i 


ever, a two-stage 
oviding an explana- 


The Overtraining Reversal Effect. If pre- 
training with Cue A relevant and Cue B irrele- 
vant is sufficient to ensure that attention to 
A is less likely to extinguish before choice 
responses than is attention to B, it seems 


reaching criterion, and are then reversed, speed 
of reversal learning js 


fits reversal learning, Shortly afterwards the 
results were confirmed by Pubols (1956) and 
by Capaldi and Stevenson (1957). 

Both Reid and Pubols attempted to explain 
their results in terms logically similar to those 
employed the 
concept of the orienting or discriminating re- 
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sponse instead of that of attention). W 
training, they assumed, ensured that su : 
would continue to look at or orientate tow 
the relevant stimuli throughout oi 
whereas nonovertrained subjects would 
tinguish any tendency to look at the n 
stimuli and would “run blind.” Stated ral 
of attention, this explanation runs as {0 pilit 
If overtraining increases the próba ai 
of subjects attending appropriately Wi i 
Causing a corresponding increase mM bie 
response strength, then overtrained a 
will continue to attend to the rele 3 
during reversal and will only have s i 
tinguish their original choice response 
acquire new ones; while nonovertrainen, 
jects will cease to attend to the relevan pid 
attend to other irrelevant cues (during prel 
time they will learn little about the °° at 
tion of the relevant cue with rewa the ® 
hence will take a long time to learn „ini 
versal. This explanation of the overta ger 
reversal effect was first suggested by aq mot 
land (1959) and has since been stated ™ | 
formally by Lovejoy (1965). ; 
At least two other major categor! 
planation haye been advanced for the eve 
training reversal effect, Capaldi and igh 
Son (1957) suggested that overtraining © 4 
benefit reversal by reducing resistan ol! 
extinction. Such a Suggestion could supP iy 
ollow either from the discrimination ait a 
SIS of extinction (advanced by Cap? gó’ 
Stevenson, 1957, and also by Guttman, oil! 
or from a Motivational (frustration) 4 ' 
of extinction (Amsel, 1962; Birch, 19? sol! 
for yet other reasons (D’Amato & J*° 
1960, 1961), F 
Alternatively, overtraining might Pori 
reversal by reducing differential 1% o 
tendencies to irrelevant ¢ j 
tion). This Suggestion 
by Spence (cited by 
Since been propounded rather more 2°" W 


fe 
es Ot 


ause any bias © pef (| 
rather than ano gei 
he excitatory të g "ul 
elicited by each position, and increas! on 
i which each por att y 
rewarded and not 1e% y? 
would gradually equate these excitat™ 
ues. 
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Set G be noticed that neither of these 
fend ions of the effect of overtraining on 
(1950) is able to account for Lawrence's 
TA eee results. In Lawrence’s experi- 
bp ant Istinction between the two groups 

had ake a amount of training that each 

versal rie but in its relevance to the re- 

Seti z F and „this notion of relevance 
Be Ry ie entirely outside the scope of 

sarily in nee. This does not neces- 

vided o that the explanation they pro- 
ina ere e overtraining reversal effect is 

Bean aia only that the attention theory 

owever Ti wider range of results. In fact, 
against Vi ere: is overwhelming evidence 
training ne extinction account of the over- 

5 reversal effect, and quite adequate 


vider 

ne ` 5 

adantars against the irrelevant response 
Ptation account, 


© maj Ste 5 
on Major objection to an explanation of 


faste raining reversal effect, in terms of 
that Sic ere following overtraining, is 
D accur ; account does not even provide 
data, ate description of reversal learning 
` tO re waPitically it is presumably justifiable 
Parts. ; reversal learning as comprising two 
the fone extinction of the tendency to select 
tendency y S+, and (b) acquisition of a 
4 xtinction wee the new S +. Stage (a) is 
Onti uin n phase and might be regarded as 
than 5 eS as long as the subject scores less 
Yih an oe Correct, while Stage (b) starts as 
x Scriptive pect scores 50% or better. At 
Cou impli evel, therefore, the extinction 
evig sal by al that overtraining facilitates 
e aa aoe Stage (a). There is no 

s is er to support this claim. 
tha na “ arisen because of the undoubted 
She reduce mien increased training does 
Mage da: Ts Stance to extinction (Hill & 
Sigg SH, 19 ee Ison & Cook, 1964: 
inthe’ Wagnes oo & Stimmel, 1960; 
(194 experiment en Wagner, 1963). Thus 
). Where di OF irch, Tson, and Sperling 
train Of running Scrimination was defined in 
Ng reversal TER in a runway, the over- 

lly Matick T obtained can be eco- 
IS dogs eh en um terms of extinction. 
eo os eo ee 
6 tater. situation will reduce 

it Pr ction of choice responses, 
ction Al not; it increases resistance 
> All Investigators of reversal 


w 
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learning who have actually published data on 
their subjects’ tendency to continue selecting 
their former S+ during the frst half of 
reversal learning (whatever precise measure 
of this tendency they have used) have uni- 


formly found that overtraining increases 
resistance to extinction (Komaki, 1961; 
Mackintosh, 1962, 1963b, 1963c, 1965b; 


Reid, 1953). Overtraining facilitates reversal 
of a simultaneous visual discrimination not 
because of, but in spite of its effect on extinc- 
tion. 

It is true that in several of these cases, the 
measure of choice response extinction was the 
number of initial consecutive errors at the 
outset of reversal, and that one might argue 
(as does, for example, Sperling, 1965) that 
greater initial resistance to extinction follow- 
ing overtraining might be followed by subse- 
quently more rapid extinction. However, 
Mackintosh (1963c) showed that overtrain- 
ing increased the number of trials taken to 
reach an extinction criterion of equal choice 
of S + and S — over 10 trials. In the light of 
this evidence, any further suggestion that 
overtraining facilitates reversal by producing 
faster extinction must be using the term 
“extinction” in a way quite different from 
that in which it is used here. 

Much of the evidence to be discussed be- 
low is also incompatible with the extinction 
account of the overtraining reversal effect: 
for example, experiments on the effects of 
irrelevant cues on reversal learning and on 
nonreversal shifts produce results either in- 
explicable in terms of this account or actually 
contradicting it. Nor does it seem easy to 
account for the basic difference between the 
results of visual and spatial reversal experi- 
ments in this way (but see Sperling, 1965). 
These arguments, however, are not to be taken 
as denying (what must be true) that, other 
things being equal, rate of extinction and rate 
of reversal will be positively correlated. What 
is being denied is that in simultaneous visual 
discriminations other things are equal. Cer- 
tainly, as Theios and Blosser (1965) have 
claimed, it is possible that reward magnitude 
is an important variable in determining 
whether overtraining facilitates reversal, be- 
cause reward magnitude and overtraining in- 
teract in their effect on extinction. But this 
would not constitute an explanation of the 
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rats by Mackintosh, 1963b; for chicks by 


relevant cues, while th 
claims that it is because they have h 
ther opportunity to learn to attend to the 
relevant cue. Decision between the two ac- 
counts can be reached by delaying the in- 
troduction of the extra irrelevant cue until 


€ attention theory 


ad fur- 


reversal. Overtrained animals, although they . 


have had no Opportunity to adapt out the 
cue, are still able to ignore it during reversal 
(Mackintosh, 1963b), and this can only be 
because they have learned to attend to the 
relevant cue. 
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More decisive evidence against Spent 
explanation of reversal learning comes 0 
experiments on nonreversal shifts to be © 
sidered below. Further evidence in fay 
attention theory comes from experi 
Which have shown that the relative i 
of extinction of attention and choice eS 
do indeed vary as a function of or ) 
in the way predicted by the theory: f th 
capitulate: The attention explanation p 
overtraining reversal effect is based upy th 
assumption that overtraining increase 
probability of attending to the relevat ond 
during reversal without causing a corre P 
ing increase in choice response streng pec" 
result of this is that overtrained SU sil 
extinguish their choice responses WP no! 
attending to the relevant cue, whereas ont 
Overtrained subjects extinguish their te” full! 
to attend to the relevant cue befor l 
extinguishing choice responses. A ae 0 
of this hypothesis was to train subje ove 
a given discrimination with or without wl 
training and then to extinguish ` em te 
they selected S+ and S— equally oi! 
When reversed, overtrained subjects, Be 
extinguished choice responses only, ha! ons 
to establish newly appropriate ee 
while nonovertrained subjects, who had R 
attending to the relevant cue without ey j 
extinguished choice responses, when g i 
tended to the relevant cue again, $ yp 
responding in the direction in which oh 
Were originally trained, Tn two exper ii 
With rats (Mackintosh, 1963c, 19650) al 
difference was demonstrated: Upon "° 


former S+, w 


Variability of the Bfect. A numbet oi 
vestigators haye claimed that the overt it 
s oversal effect: isan elusiva even suspire | 
inconsistent, phenomenon. Doubtless Pd ; 
much Variability in the results of Ps 0 
logical experiments owing to varia, i 
experimental techniques, but there "af 
evidence that the overtraining revers? p, 
'S unusually prone to inexplicable aieh sy | 
ances and sudden reappearances. AD gf 
gestion that this is So seems to 
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due to confusion of two separate classes of 
experiment—reversal of visual discrimina- 
tions and reversal of spatial discriminations. 
So far we have been talking about reversal 
of simultaneous visual (usually brightness) 
discriminations. To date there have been 13 
Such experiments on rats, the results of which 
ate presented in Table 1. 
ae also been possible to obtain the 
thin i octopuses (Mackintosh & Mack- 
a da 963), chicks (Mackintosh, 1965b), 
provid en (Youniss & Furth, 1964a,b), 
i ch the discrimination was relatively 
spit and contained irrelevant cues. In 
€ of Erlebacher’s (1963) single failure, 
and Behe the repeated failure of D’Amato 
ile iff (1965) to obtain the effect, the 
ticulant of these experiments are not par- 
aah Contradictory. It can certainly not 
` k i that the effect is an elusive one. 
rey e other hand, the results of position 


altho experiments (shown in Table 2), 
are 


$ü somewhat more conflictin 
he g 8 
j Tht decisive in failing to find the effect. 
only eee a total of 22 experiments, in 
reversa] en of which did overtraining facilitate 
“al. The result in the “no effect” col- 
reves usually that overtraining retarded 
‘The » but not significantly. 

tlie ee elem, then, is how to explain both 
the g ral difference between the visual and 
the tial discrimination experiments, and 
Stug, wicting results within each class of 
Visual pn’ fairly obvious suggestion is that 
Patia) roblems contain irrelevant cues, and 
Of ure lems in general do not. It has 
© already been shown that irrelevant 
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TABLE 1 


RESULTS OF VISUAL DISCRIMINATION REVERSAL 
EXPERIMENTS WITH RATS 


Overtraining facilitates 
reversal 


Overtraining has no 
effect on reversal 


Brookshire, Warren, and Ball 
(1961) 

Capaldi and Stevenson (1957) 

D'Amato and Jagoda (1961) 

Komaki (1961) 

Mackintosh (1962, 1963b, 1965b) 

North and Clayton (1959) 

Pubols (1956) 

Reid (1953) 

Sasaki (1960) 


Erlebacher (1963) 
D’Amato and Schiff 
(1965) 


? 
cues are important in producing the effect. 
In support of this suggestion is the fact that 
two of the spatial experiments in which 
overtraining facilitated reversal (Brookshire, 
Warren, & Ball, 1961, and Pubols, 1956) 
id actually involve irrelevant visual cues, 
while a third (Bruner et al., 1958) was a 
relatively complex alternation learning prob- 
lem in a 4-unit maze, instead of the normal 
simple T-maze problem. Similarly, Youniss 
and Furth (1946b) found that overtraining 
only facilitated reversal of a spatial discrimi- 
nation in children when there were irrelevant 
cues. Furthermore, even in a brightness dis- 
crimination the experimental situation may be 
such as to minimize the importance of ir- 
relevant cues, and thereby reduce the chances 
of finding an effect. It is possible that the 
automatic apparatus used by D’Amato and 
Schiff (1965) in their brightness reversal 


TABLE 2 
RESULTS OF SPATIAL DISCRIMINATION REVERSAL EXPERIMENTS WITH RATS 


Ove, 
Train} ays 
training facilitates reversal 


Overtraining has no effect 
on reversal 


Overtraining retards reversal 


TOOks a 
Brun ae Warren, and Ball (1961) 
c (1958) ndler, O'Dowd, and Wallach 


Dal; 
Ron att (1963) 


p, > 

nba Gees (1961) 
& 

Theigg T (1961) 


nd Blosser (1965) 


Clayton (1963a, 1963b) 
D’Amato and Jagoda (1962) 
D’Amato and Schiff (1964) 
Galanter and Bush (1959) 
Hill et al. (1962) 

Iwahara (1959) 

Komaki (1962) 

Mackintosh (1965c) 


Hill and Spear (1963b) 

Hill, Spear, and Clayton (1962) 
Krechevsky and Honzik (1932) 
Mackintosh (1965c) 


Paul and Kesner (1963) 
Theios and Blosser (1965) 
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experiments provided very few irrelevant 
spatial cues—certainly rats seemed to take 
a long time to learn a simple position habit 
in this apparatus (D’Amato & Jagoda, 1962). 
On the other hand, as pointed out by Lovejoy 
(1965), Pubols’ results are not reliable, since 
his two groups were not properly matched in 
initial learning speed, and Clayton (1963b) 
performed a direct test of the effect of irrele- 
vant cues on spatial reversal with negative 
results (in fact, he claimed that irrelevant 
cues actually retard overtrained subjects more 
than nonovertrained subjects, but his claim 
is unconvincing since he failed to find any 
difference in overall trials to criterion), ` 

The question of the importance of irrele- 
vant cues in spatial reversal is one that 
deserves further investigation, Nevertheless, 
it is extremely unlikely that this is the only 
critical difference between visual and Spatial 


problems. Mackintosh (1965c) ha 
that one 


(see Mackintosh, 1965c). 


A third possible explanation in terms of an 
attention theory has 


joy (1965). He points out 


med to depend 
‘asing to attend 


nonovertrained subjects will 
so throughout reversal. Rats learn spatial 
discriminations very fast and are generally 
reckoned to be dominantly spatially oriented: 
therefore, spatial cues have a high priority 
for rats, and are automatically attended to. 
Under these circumstances, a two-stage theory 
contracts into a single-stage theory and will 
not predict an overtraining reversal effect. 


continue to do 
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Evidence bearing on this general point 
view has been provided by Mackintos 
(1965b). Chicks are dominantly visual ati 
mals, learn simple visual discriminations ve 
fast, and do not show the overtraining it 
versal effect unless provided with an extreme 
difficult discrimination, jelly 
Two other possibilities should be La 
mentioned. Theios and Blosser (1965) a 
argued that the overtraining reversal © ed. 
only appears when a large reward is nat 
The basis of this suggestion is the assumi 
that reward magnitude and overtraining A an 
act to determine resistance to extinction, h 
assumption that receives support from no 
study of Ison and Cook (1964), pei at 
from that of Madison (1964). This ceti 
best be only a partial explanation, ffect 
variations in the overtraining reversal a 
can occur in the absence of changes in 365b); 
magnitude (e.g., Mackintosh, 1963b, 19 dill 
it may nevertheless have some ve nat? 
Finally it is worth noting that D heir 
and Schiff (1965) in all but one of rials 
experiments gave 240 overtraining © iy. 
There is in fact no evidence from any fet 
lished study that overtraining to this - 
will ever facilitate reversal learning. ge 
Tt can be seen that there is no sh? 
of possible explanations for the differ® 
between various experimental results: on 
that several of these explanations stem "od 


. sit 
an attention theory. It should be empha" OF 
that these ex 


clusive. Ther, 
one is right 


nce 
nd 


Overdetermined, rl 


ye 
research on Tepe 
t must be made. y ? 


reason that res of 
rated almost exclusive” pes 
overall trials or errors to criterion p w 

here have been relatively few attemP” ip? | 
examine in detail Possible differences ™ er | 
course of reversal learning betwee? yet | 
trained and nonovertrained subject® gp 
these are the critical differences whic ffe 
explanation of the overtraining reversal ed” 
must handle, anq which must in fact 
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between alternative explanations. For ex- 
ample, the occasional failure to find an over- 
training reversal effect in brightness discrimi- 
nation problems might be either because the 
"relevant spatial cue was not a dominant one, 
oo some factor was increasing re- 
Ce to extinction in overtrained subjects. 
— oo of the daily reversal 
ut R decide between these alternatives, 
Tn the { see are virtually never provided. 
vide the « ure, experiments which fail to pro- 
te oe necessary for such decisions 
aes fe e hope of advancing our knowledge. 
studies no need for a proliferation of further 
Siving bare trials to criterion data. 
Medi and Nonreversal Shifts. H. H. 
aila T, S. Kendler (1962) and their 
n es have laid almost exclusive emphasis 
tates soe Matanog of a comparison between 
Shifts as earning reversal and nonreversal 
theory i riene for or against a two-stage 
`- Aave already argued that this em- 
misplaced, since the predictions a 
reversa] Je theory makes about speed of 
tion o ng depend on further specifi- 
Utention ne relative rates of extinction of 
it see choice responses. Nevertheless, 
Possible re as they recognize, that it is 
© explain the superiority of reversal 
theo, versal shifts in terms of a two-stage 
has Sen ens (once partial reinforcement 
to do so „equated) it does not seem possible 
Evidence, terms of a single-stage theory. 
than ing ce that reversals are solved faster 
Ments e ee shifts comes from experi- 
arrow college students (Buss, 1956: 
D mato, 1 Friedman, 1958; Kendler & 
» 1955), and children over the age 
r endler & Kendler, 1959). Younger 
a eta Kendler, & Wells, 1960) 
tha, | than Eata 1956) learn nonreversals 
leet Young chi sals. From this it is claimed 
ĉarnerg » S, Children and rats are “one-stage 
e r rh it is not a necessary conse- 
a ee theory that reversals 
thet §8 nothing © ates fhan nonreyersals, 
Co Posite. hg Pe Darrassing about finding 
of Snows that act, more careful considera- 
t eteni > In view of the explanation 
his je nmg reversal effect just pro- 
ted. TF anime ely what would be ex- 
will only dals are trained to criterion, 
y learn a reversal faster than 
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a nonreversal if criterion training is sufficient 
to guarantee that attention extinguishes more 
slowly than choice responses. But this is 
exactly what has been assumed not to hap- 
pen; in rats this is assumed only to occur 
after overtraining. 

In rats, therefore, criterion training should 
produce faster learning of nonreversals than 
of reversals, while overtraining should at the 
least reduce this difference. This is true 
for young children—compare the results of 
Kendler, Kendler, and Wells (1960) with 
those of House and Zeaman (1962)—and 
although no one experiment has provided a 
direct test of the proposition in rats, the 
evidence is strongly in its favor. Kelleher 
(1956) has confirmed the first part, while 
Mackintosh (1962) has shown that although 
overtraining benefits reversal learning it re- 
tards nonreversal shift learning. 

This last result has been confirmed on sev- 
eral occasions (in rats by Mackintosh, 1963c, 
1964; in octopuses by Mackintosh & Mack- 
intosh, 1964). However, Brookshire et al. 
(1961) and Uhl (1964) failed to find any 
effect, and Komaki (1961) has claimed that 
overtraining benefits nonreversal learning. A 
possible source of confusion here is that both 
Brookshire et al. (see Mackintosh, 1964) 
and, in one of his experiments, Komaki (see 
Law, 1954; Mackintosh, 1963a) may have 
used a nonreversal discrimination that in- 
volved the same relevant cue as did original 
learning. If overtraining increases the prob- 
ability of a subject attending to a particular 
stimulus dimension, then this would account 
for their results. Uhl’s results are not strictly 
comparable with other experiments, since he 
trained his subjects on a partial reinforcement 
schedule. Sutherland’s (1964) explanation of 
the effects of partial reinforcement in terms 
of an attention theory would, in fact, prob- 
ably predict this result of Uhl. 

If it can be accepted that, at the least, 
overtraining does not facilitate nonreversal 
learning, then this decisively rules out any 
explanation of the overtraining reversal effect 
that fails to take into account the fact that a 
reversal involves the same relevant cue as 
original learning. It is obvious, for example, 
that a simultaneous brightness discrimination 
and a simultaneous orientation discrimination 
share irrelevant (position) cues in common, 
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and if overtraining is supposed directly to 
reduce the probability of responding to ir- 
relevant cues, it must benefit nonreversal 
learning at least as much as, and possibly 
more than, it benefits reversal. 


‘THEORETICAL INTERPRETATIONS 


This survey of the relevant experimental 
evidence has established two main claims: In 
solving discrimination problems, animals (a) 
do not attend equally to all features of the 
stimulus situation, and (b) must learn which 
features to attend to. One of the major theses 
of this review is that there is a logical con- 
nection between these two claims, since if an 
animal automatically classified its input in 
all ways simultaneously, there would be no 
need for it to learn to perform the appropriate 
classification. Such a need only arises if ani- 
mals do not automatically attend to the 
relevant stimulus dimension, 

Surprisingly, the two Positions seem to 
have grown up more or less independently. 
Noncontinuity theorists like Lashley and 
Krechevsky introduced the concept of atten- 
tion into discussions of discrimination learn- 
ing by claiming that animals attended to only 
a single cue at a time, but they made little 
systematic attempt to consider what implica- 
tions this position would hold for the analysis 
of the learning Process. Conversely, there has 
recently been a great growth of interest 
among relatively orthodox S-R theorists in 
various two-stage models of discrimination 
learning, but very often the connection with 
the original question of attention has been 
totally ignored. Among other possibilities, it 
has been suggested that the subject must ac- 
quire relevant mediating responses (Kendler 
& Kendler, 1962), or observing responses 
(Miller & Dollard, 1941; Wyckoff, 1952), 
or orienting responses (Spence, 1960). Rather 
closer to the spirit of this review, others have 
suggested that the subject must attend to 
the relevant dimension (Goodwin & Lawrence, 
1955), learn to switch in the relevant analyzer 
(Sutherland, 1959), or learn attention re- 
sponses (Lovejoy, 1965; Zeaman & House, 
1963), while Lawrence (1949, 1963, Tespec- 
tively) has also talked of the acquired dis- 
tinctiveness of cues, and of stimulus coding. 

Although there are clear logical similarities 
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ions 
between all these theoretical suggeri 
there are also differences of ao dit- 
them, and in the following account t a 4, 
ferences may well appear to be paged: 
but such exaggeration is not merely pe be no 
In the light of the evidence there can com 
doubt that some sort of two-stage be, 
of animal discrimination learning a well 
provided, and up to this point there oo is 
be agreement. But if theoretical ais Jines 
be made, it must be partly along E nature 
of specifying more precisely the exac i 
of the two stages. This section repre mode! 
preliminary attempt to classify the sides 
Proposed so far, and very briefly Jevnt to 
some of the evidence that will be rele 
deciding between them. 

For expository purposes, the models i 
divided into three classes: (a) one coll): 
observing response models (Spence, W y ler 
(b) mediate responses models (Ken (Law 
Kendler), and (c) attention models 
rence, Sutherland, Zeaman & House): 


n be 


Orienting Response Theories pe 

To S-R theorists there seems to ia H 
something doubly distasteful about m 
cept of attention. In a very general versi” 
way, its use would appear to imply a re chol 
to a mentalistic, even introspective Pi to 
08y; more specifically, it would aPP peot! 
imply a central mechanism, and S-R al 10° 
is happiest when dealing with exter? ie 
Sponses. The concept of the orienti” g a 
Sponse, in fact, has been put forward J 
external substitute for attention. pis 


A f 
Already in the earliest formulation ane 
theory of discrimination learning, gte 
(1936) noted that “th 


e mere presence ° ptal 
re in the experi™” ppt 
rantee its imping? tbe 
orium at or neat ge 


cue stimulus somewhe 
situation does not gua 
on the animals sens 
critical moment 


1 
later elaborated this by saying that solt 


Baas = Fy 5 
of a discrimination problem involve 
learning of ap 
. b the a 
responses which lead to the reception ony a 
propriate aspects of the total environmen gent lal 
plex on the animal’s sensorium, e.g., the Of" jti 


crit ge 
and fixation of head and eyes towards the "; of 
stimuli. That is, 


k a 1 
the animal learns to Mook | 1% 
aspect of the situation rather than anothe 
P. 432], 
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Thus, when confronted with evidence that 
animals had failed to attend to the relevant 
gue in Krechevsky’s (1938) presolution 
Period experiment, Spence (1940) claimed 
Pai ths animals “did not see the rows of 
Sthet e oo they were fixating 
281]. pects of the stimulus complex [p. 
an Lashley (1942) insisted, this particular 
eer somewhat implausible from purely 
rit considerations of the angular sepa- 
ae of different parts of the stimulus cards 
Vision Ping stand, and of the rat's field of 
Tore + and as Goodrich et al. (1961) have 
when ne pointed out, other criteria of 
urn ats are looking at a particular stimulus 
these i dangerously circular. In spite of 
total | 1ficulties, and in spite of the almost 
ark of evidence showing the importance 
ion $ orienting responses in discrimina- 
oa Ag by rats—Ehrenfreund’s results 
M were seemingly contradicted by 
ona tad rd type of account has 
Tecent stat easingly popular—Spence's most 
able sty ement of his theory laid consider- 
1960) ess on orienting responses (Spence, 
(1953) and both Pubols (1956) and Reid 
Teversa point tag to explain the overtraining 
is effect in these terms. ; 

Renious Popularity is partly due to the in- 
here oe pioneered by Wyckoff (1952), 
Specified als literally had to learn to make 
Cri inatiy Tesponse in order to see the dis- 
*Perime, e stimuli, The ingenuity of such 
Shoulg me and the interest of their results 
do ind us neither to the fact that they 
*imaPs manipulate the orientation of an 
that ; ` T€ceptors at all, nor to the possibility 


A n ; os P P 
Derim, Sorma] discrimination learning €x- 
peessary no such receptor orientation is 
e gp? Since in any suitable apparatus 


i e k 
Stimuli” ct automatically sees the relevant 


Port ; 
Man Gee it is possible to go further 
in tention on to show experimentally that 
L terms o nnot be adequately conceptualized 
o Mrence Pee: responses. Goodwin and 
setting it 955) showed selective attention 
oy wli Weie situations where the relevant 
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enting responses. Their argument is plausible, 
but not compelling. But if rats are trained 
on a discrimination between a black hori- 
zontal and a white vertical rectangle, there 
is no conceivable way in which they could 
orientate themselves so as to look at the 
orientation differences between the stimuli 
without thereby seeing the brightness dif- 
ference (and vice versa). Thus, demonstra- 
tion of selective attention between these 
dimensions in this situation decisively rules 
out any orienting response interpretation 
(Mackintosh, 1965a). 

The writer has also specifically tested the 
explanation of the effect of overtraining in 
terms of orienting responses proposed by 
Pubols (1956) and Reid (1953). If over- 
trained animals are more likely to look at 
the appropriate physical stimuli, then over- 
training on a square versus diamond discrimi- 
nation, in which both stimuli are sometimes 
black and sometimes white, could only benefit 
the learning of a black- versus white-square 
discrimination. In fact, as is predicted by 
attention theory, overtraining retards learning 
in this situation (Mackintosh, 1964). Thus 
overtraining benefits reversal learning, not 
because the physical stimuli of the reversal 
problem were present during original train- 
ing, but because both problems involve the 
same stimulus dimension. It is just not 
reasonable to suggest that any orientation 
of an animal’s receptors could be responsible 
for selection between dimensions of stimuli. 


Mediating Response Theories 


The notion of the orienting response shares 
one feature in common with the concept of 
attention: At least it is clear that if animals 
fail to establish appropriate orientation of 
their head and receptors they will fail to 
solve the discrimination problem. However, it 
is also clear that there is a sense in which 
animals attend to some cues and ignore 
others, which cannot be characterized by 
talking of overt orienting responses. Perhaps 
some such considerations have led other S-R 
theorists into adopting a mediating response 
theory, but there has been a tendency to 
leave the precise nature of these mediating 
responses unspecified, and consequently no 
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such arguments as those above can be 
marshalled against the theory. 

This vagueness, however, 
further difficulties, of which the foremost 
is that the function of these mediating re- 
sponses remains very unclear. Kendler and 
Kendler (1962), for example, do not regard 
the occurrence of mediating responses as 
necessary for discrimination learning, since 
children under the age of five and all non- 
human animals are assumed not to have such 
responses available. There is no evidence 
that older children and adults are notably 
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response extinguishes faster, then to learn a 
honreversal the subject simply extinguishes 
the old mediating response, establishes a new 
one, and attaches new overt responses (mak- 
Ing a total of three connections); while to 
learn a reversal, the subject must extinguish 
the old mediating response, extinguish its old 
pe responses, reestablish the old mediating 
Ponse, and establish new overt responses 
making a total of four connections). 
Pa likely, then, that in order to explain 
ee “oy of this type of theory the greater 
Teversal to nonreversal shift learning, 
eee response must be assumed to 
oes more slowly than the overt re- 
structed ut if a two-stage model is con- 
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Moa ions about the laws governing the two 
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this it oo strength; as a consequence of 
ny Hoe the mediating response that reaches 
en stage of extinction first. 
ulus’ therefore, the claim that “implicit stim- 
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versal learning should be explicable in these 
simple terms, and this exciting possibility of 
actually explaining species differences in 
learning experiments seems to be denied by 
the assumptions of S-R theories in this area. 


Attention Theories 


The criticisms made above of orienting and 
mediating response theories can be summar- 
ized by listing three requirements of a more 
adequate two-stage theory. 

1. The attention stage should serve the 
function of permitting the animal to discrim- 
inate a given stimulus dimension. Evidence for 
this comes from all experiments demonstrat- 
ing that animals do not learn about all cues 
equally. 

2. This effect cannot be exclusively due to 
overt orientation of the animal’s receptors. 
Attention selects between stimulus dimensions 
both in humans (Harrow, 1964; Isaacs and 
Duncan, 1962), and in rats (Shepp and 
Eimas, 1964), and not between stimulus ob- 
jects (Mackintosh 1964) as is implied by 
orienting response theory. 

3. There is no evidence to support the as- 
sumption that attention and choice responses 
obey exactly the same laws, and the possibil- 
ity of altering the parameters of one inde- 
pendently of the other generates explanation 
of a wide range of comparative data. 

Although, therefore, the similarities between 
different two-stage theories are considerable, 
there remain important differences between 
them. It is possible to test some of these 
differences experimentally, and the results of 
such experiments appear to lend support to 
the versions proposed by Goodwin and Law- 
rence (1955), Sutherland (1959), and Zeaman 
and House (1963). 


CONCLUSIONS 


The available evidence both supports a 
two-stage model of discrimination learning as 
opposed to a single-stage model, and within 
the class of two-stage models supports some 
of the implications of an attention theory as 
opposed to other theories. At the least, then, 
it would seem worthwhile to explore the po- 
tentialities of attention theories of learning 
further. Two possibilities suggest themselves. 
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On the one hand, if the concept of attention 
is one that can be profitably applied in the 
area of discrimination learning, it is worth 
considering its application to other problems. 
It is tempting, for example, to suggest that 
the absence of latent learning under some 
conditions, such as high irrelevant drive 
(Johnson, 1952) or prior overtraining (Brad- 
ley & George, 1962), can be explained in 
terms of a failure of attention to the relevant 
goal object; or that some of the differences 
between ordinary conditioning and sensory 
preconditioning (Hoffeld, Kendall, Thomp- 
son, & Brogden, 1960) reflect not a basic 


: preliminar 
evidence (Sutherland, Mackintosh, & Wolfe, 
1965) supports this interpretation, Í 


respectability (mod- 
Broadbent, 1957, and 


to learn than a nonreversal shift. But if the 
theory were sufficiently precise there would 
be no room for verbal argument; either the 


and “stat-subjects” 
appropriate experimental situation, 
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procedure would give an unambiguous answe! 
to the question. 

Fortunately some preliminary steps have 
already been taken. The mathematical theory 
of Zeaman and House (1963) has been suc 
cessfully applied to the behavior of retard 
children in a number of situations, And Toga 
joy (1965) has proposed a mathemati 
model which he has shown will predict 
overtraining reversal effect. Exploration i 
the potentialities of these theories WOU 
Seem to promise considerable rewards. 
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A NOTE ON REPEATED MEASURES IN THE STUDY 
OF SHORT-TERM MEMORY 


LLOYD R. PETERSON ! 


Indiana University 


Keppel’s (1965) objection to averaging over repeated measures is discussed. 
Several possibilities for minimizing proactive interference are suggested, includ- 


ing increasing time between tests. 


Consideration is given to acceptance of a 


constant amount of proactive interference over conditions in which other 
variables are studied. The advantages of using repeated measures in regard 
to efficiency as well as interpretation are noted. 


aig Pel (1965) has written a stimulating 
7 ie ion of some methodological problems 
Study of short-term verbal retention. 
© Problem concerns the use of repeated 
i and his conclusions in this regard 
aver ed taken as a virtual proscription on 
Subject, H & Over repeated tests on the same 
of STM He says, “. any complete study 
unt ia involve a manipulation of the 
t of prior-learned material [p. 6].” In 
“cussion that follows it becomes clear 
= Slat a testing large numbers of 
to allow independent-group com- 
on the first retention test and a 
eg introduction of subsequent tests 

he. he same and different conditions.” 
Sear, ch in implications of this position for re- 
€ area are serious. The value of 
iit a. ube of studies already reported 


oe 


Since t the testionable, and the future is bleak, 

large numbers of subjects required 
T “in of the type that Keppel en- 
Partie” Hake research slow and expensive. 
aig . he suggested solution will render 
Teten any conclusions about the nature of 
for era functions for individual subjects, 
of aver aging between subjects takes the place 
the ; "aging within subjects. An instance of 
short e taice of individual differences in 
Pola memory is seen in a report by 
Measur 1963). Tn an analysis of repeated 
thita g ements Pollack compared the highest 
Orman is Subjects (in terms of total per- 
‘though with the middle and lowest third. 
Pacitigg | the differences in the subjects’ ca- 
t Thi j immediately recall six digits were 
“rin me je was written at the Institute of Human 


niversity of California, Berkeley. 


small, the differences when one extra digit 
intervened between presentation and recall 
were very large. The high scoring group 
dropped from 9896 correct without the extra 
digit to 95% with the extra digit. On the 
other hand, the low scoring subjects dropped 
from 83% to 35% correct with the addition 
of the digit. The curves obtained in a para- 
metric study of number of interfering digits 
and duration of additional delay were plotted 
separately for high, middle, and low subjects, 
and the differences in shapes of the curves 
were marked. Such differences would have 
been overlooked without averages over re- 
peated measurements of the same subjects. 

Tt should be noted that Keppel’s (1965) 
concern over repeated measures stems from 
studies (Keppel & Underwood, 1962) in 
which a progressive decrement has been 
observed over the first few tests with a 
technique in which recall of a trigram is 
measured after an interval filled with ir- 
relevant verbal behavior. This is an impor- 
tant finding which must be considered in 
accounting for short-term forgetting. How- 
ever, when this decrement is not the object 
of study for its own sake, there are ways of 
minimizing it to permit study of other vari- 
ables. To the extent that this decrement is 
due to proactive interference, the effect should 
be maximized for retention intervals of long 
duration, since proactive interference in- 
creases with the retention interval (Keppel & 
Underwood, 1962). Conversely, proactive in- 
terference is minimized in that class of studies 
in which recall is tested shortly after presenta- 
tion, as in studies of immediate memory span, 
In the testing of longer retention intervals, 
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proactive interference can be reduced by 
lengthening the interval between tests. In an 
extreme case, Loess (1964) found that after 
a period of three weeks no effect of a previous 
session remained. In dealing with more work- 
able intervals between tests, Peterson and 
Gentile (1965) have found a significant reduc- 
tion in the effect of previous tests when the 
interval from the signal for recall in one test 
to the beginning of the next test was length- 
ened from 5 to 16 seconds, and a further re- 
duction from 16 to 91 seconds. In the Keppel 
and Underwood studies (1962) the intertest 
interval was not specified, but Keppel ? indi- 
cated that it was 10 seconds. This is shorter 
than the 15 seconds of the initial studies using 
this technique (Peterson & Peterson, 1959), 
The marked Progressive decrement as a 
function of previous tests, found by Keppel 
and Underwood in their third experiment 
(1962), may well have been influenced by 
this short intertest interval as well as by the 
bias that Keppel mentioned. In other experi- 


ments, as Keppel noted, a Steady state has 
quickly been reached. 


Since a stead 
initial 


The experimenter wi 
results may not gene 
tion to the case whe 
is of less effect, Thi 
ment yielding a curve which 
should be labeled the “RI [ 
tion] work curve.” In so lab 
a study by Peterson, Saltz 
Land (1962), Keppel impli 
the legitimacy of the prin 
out a constant 


experi- 
Keppel says 
retroactive inhibi- 
eling a curve from 
man, Hillner, and 
citly acknowledges 
ciple of balancing 
amount of an unwanted vari- 


* Personal communication, 1965, 
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able. His RI work curve was obtained e 
conditions where varying numbers of Jatim 
ous pairs were given before the presen’ 
of a tested pair. The first pair in a avoi 
of presentations was a dummy used t0 d be 
the marked decrement commonly foun ence: 
tween the first and second pairs in a sequ airs 
Other differences in numbers of pee . 
were balanced out by the Lesgaunet wor 
sign. The curve could be called an R mou 
curve under conditions of a constant A iy, 
of proactive interference. The sgan 
volves a kind of mixing of proactiv a 
retroactive interference similar to that © vel 
ing within a list of pairs being learne pion 
trials. The chief difference is that on ave’ 
intervals for individual pairs were si of 
aged as in list learning, but each leng 
interval was treated separately. 
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EQUIDISTANCE TENDENCY AND ITS CONSEQUENCES 
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The equidistance tendency is the tendency for objects or other inhomogeneities 
in the ficld of view to appear at the some distance as each other with the 
strength of this tendency being inversely related to directional separation. The 
evidence for the existence of the equidistance tendency and for its ability to 


modify the perceived depth resulting f 


rom size or stereoscopic cues is reviewed. 


The equidistance tendency is discussed as a disturbing factor in visual experi- 
mentation and as a necessary factor in the understanding of Emmert’s law, the 
moon illusion, and similar phenomena. Several possible explanations for the 


equidistance tendency are evaluated br: 
with which it is identified. 


The 


Usual perception of volume seems to be the 


sual ep resulting from homogeneous 
low. mulation (Natsoulas, 1963). It fol- 
p homogeneous visual fields the 
Si specific cues is not a necessary 
ei EA the perception of depth. How- 
apply a me conclusion does not necessarily 
Mogeneiti, Perception of depth between in- 
ur withont in the visual field when these 
ecifica] ut the introduction of depth cues. 
D the question might be asked as 
fo e a appear in depth with re- 
icate T other when all cues which would 
ine ‘i they are at different distances 

at ed. It might seem reasonable to ex- 
tied de. the absence of depth cues, the 
eae between two objects would be 
y titudina] bi or controlled by momentary 
OW, dent, wes: According to this point of 
visų a Perception in the absence of 
Dro €S would be erratic because there is 


Presence 
conditi 
[; 


No 


+ Process wh; 
pa bein S which would result in one percep- 
Howey, S More likely to occur than another. 


a t is in spite of its apparent inevitabil- 
ne el of view is not the only one 
cee a “ae it might be supposed that 
ea ay of visual cues the perceived 

Me eat n two objects would tend toward 
Pino, © perception which would be de- 

Arbara, at the University of California, Santa 


iefly in terms of the range of phenomena 


termined by organizing factors within the ob- 
server. It is the purpose of this paper to sup- 
port this latter point of view by considering 
the evidence for a tendency to perceive ob- 
jects as equidistant when effective cues as to 
their perceived depth are absent. Further- 
more, it has been demonstrated that this 
tendency to perceptual equidistance can oper- 
ate with some degree of effectiveness in situa- 
tions in which not all depth cues are elim- 
inated. As will be discussed, as a result of this 
characteristic, the equidistance tendency exer- 
cises an often unrecognized influence upon 
perceptions occurring under a variety of 
visual conditions. 


EVIDENCE FOR THE EQUIDISTANCE TENDENCY 


The equidistance tendency is the tendency 
for objects or parts of objects, in the absence 
of effective distance cues, to appear visually 
at the same distance as each other with the 
strength of this tendency being inversely re- 
lated to the directional separation of the ob- 

‘jects or parts (cf. Gogel, 1963). When two 
objects (a luminous square and a rectangle) 
physically located at different distances from 
the observer were viewed in the absence of 
distance cues, it was found that the two 
objects appeared to be at approximately the 
same distance (Gogel, 1956b, Experiment I). 
It was therefore hypothesized that in the 


153 


154 


absence of contrary visual information there 
was a tendency for objects to appear equi- 
distant. The word “tendency” was used since, 
in general, it was found that considerable indi- 
vidual variation could occur in this response. 
If it is assumed that the equidistance 
tendency is present between all objects in the 
field of view, it follows that, as the number of 
objects increases, the number of tendencies 
toward equidistance increases with some of 
these possibly being in conflict. For example, 
if two objects appear at different distances, a 
third object with no depth cues with respect 
to either of these objects cannot appear simul- 
taneously to be at the distance of both ob- 
jects. It was found under these conditions 
(Gogel, 1956b, Experiment IT) 
third object appeared nearest in depth to that 
object to which it was directionally most ad- 
jacent. It was as though each of the two ob- 
jects contributed an equidistance effect with 
Tespect to the third object, but the object 
directionally most adjacent to the third ob- 


(Gogel, 1956 
tigated the perce 


Again, in each of 
single cylinder was 


oc- 
ela- 
the 
w. 
the 


tive size cue between cylinders to localize 
single cylinder midway in depth in the ro 
In the experiments discussed above 
3 


perimental manipulation, However, 
intended that these results should a 
to situations involving a combinati 


it is not 
pply only 
on of mo- 
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nocularly and binocularly viewed objects (cf 
Gogel, 1956a, 1956b, 1963). + ated 
Roelofs and Zeeman (1957) investiga” 
by means of stereograms, the perceived sit 
and distance of monocularly viewed obia 
presented simultaneously with binoculart 
viewed objects, Only the results concer ai 
perceived distance are pertinent for the E 
ent study. In one series of stereograms, rl 
monocularly observed and two binocular 
observed circles were viewed with one of i- 
binocularly observed circles being stereos- e 
cally more distant than the other. One 0 he 
monocularly seen circles was closer 0? cu- 
stereogram to the more distant of the bino he 
larly seen circles and the other closer sel 
less distant of the binocularly seen C" een 
The results were that the monocularly Yä- 
circle which was closer on the stereogra™ to 
rectionally closer in the visual fie seen 
the stereoscopically farther, binocularly nore 
circle usually was perceived to be the P, 
distant of the two monocularly seen C!" bi- 
It was concluded that the objects e of 
nocularly influenced the apparent distan? the 
the objects seen monocularly, wit? with 
strength of this influence being greate" jgual 
less separation between them in the V ent 
field. This conclusion is clearly in agree ant? 
with the demonstrations of the equidis! 
tendency cited above, ra 
Ina supplementary series of stereos ly 
Roelofs and Zeeman presented a binocl ghe 
viewed and a monocularly viewed circle. oy. 
monocularly viewed circle tended to Pe arly 


ceived as slightly nearer than the binock ute 
viewed circle, This d 
and Susceptible to 
Zeeman, 1957, p. 1 
later study 
encies were ex 


. m. 
stams and in the later study, There Se? 1? 


be a tendency, other things being ed go 
See objects in th i 


half of the visu. 
objects in the 
(1961), while a 
equidistance 

these and oth 
the positional depth tendencies which. rane 
cannot be subsumed under the equi ye 
tendency, Perhaps it is reasonable to © 


i 
99) also appeared ten” 


EQUIDISTANCE TENDENCY 


that the equidistance tendency is not the only 
such tendency in the visual field. It is the 
Point of view of this paper, however, that 
the equidistance tendency, since it applies 
throughout the visual field, represents a gen- 
eral perceptual factor upon which other ten- 
encies must be superimposed. 

The equidistance tendency has been dem- 
onstrated to apply when all the objects in- 
Volved are viewed monocularly. For this 
demonstration, two playing cards were used 
Which appeared at different distances as a 
Consequence of a difference in retinal size 

sel, 1956a, pp. 4-6). A small disc was 
Placed directionally closer to either the ap- 
rently nearer or the apparently farther card 
ig between the cards in direction. The 
Darel results indicated that the disc ap- 
tex ae closer in depth to the card to which it 
Drozina nally closer, and it appeared ap- 

i midway between the cards in 
noe it was midway between them in 

ee In a further experiment (Gogel, 
Ment pP: 7-14), the apparent path of move- 
to a a monocularly observed disc attached 
zoida Monocularly observed rotating trape- 
ralia ot was analyzed, using the gen- 
“Wuidistan that the resultant strength of the 
Vations ance tendency with respect to the 
function ars of the window would vary as a 
tion at of the changing directional orienta- 
This pois 2€ disc with respect to the window. 
that patter experiment not only demonstrates 
objec «ocular Observation of some of the 
for a = not a necessary condition in order 
t also “quidistance tendency to be effective, 
tance , "PPorts the concept that the equidis- 
Well ae y applies to parts of objects as 
anq all © objects per se. All objects, contours, 
ee of objects in the visual field 
dista ni > ad are subject to, a visual equi- 
i Gone ect with respect to all other ob- 

Ours, and parts of objects, with the 
any ‘ha S the equidistance tendency between 

7 Site inhomogeneities being in- 
io Tn to their difference in visual 

` It follows, for example, that all 
in Rites figure slanted in depth will tend 

the aye.” 2M apparent frontoparallel plane 

Sence of strong cues to slant. 
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MODIFICATIONS BY THE EQUIDISTANCE 
TENDENCY OF JUDGMENTS 
INVOLVING DEPTH CUES 


Modifications of Judgments Involving 
Stereoscopic Cues 


The equidistance tendency sometimes can 
modify depth perceptions even in situations 
in which stereoscopic cues to the apparent 
depth of the objects are available. This has 
been demonstrated in a study by Gogel, 
Brune, and Inaba (1954) in which the condi- 
tions were such that the strength of the equi- 
distance tendency was near a maximum rela- 
tive to that of the stereoscopic cue. That the 
equidistance tendency can modify a stereo- 
scopic judgment under other viewing condi- 
tions is suggested by an experiment by Harker 
(1962) in which a grid was stereoscopically 
oriented at various slants. The slant of the 
grid would be expected to induce cyclorota- 
tion of the eyes resulting in a reduction in the 
apparent tilt of the grid. The equidistance 
tendency also would be expected to act in 
the direction of reducing the apparent tilt by 
tending to make all parts of the grid appear 
at the same distance. Harker found that only 
some of the reduction in perceived tilt could 
be attributed to cyclorotation with the re- 
mainder attributed to the action of the equi- 
distance tendency. Simon (1936), in a series 
of experiments, found that a binocularly ob- 
served outline circle physically slanted in 
depth would change its apparent orientation 
under reduced conditions of observation. 
Simon suggested that the perceived orienta- 
tion (and perceived distance) of such objects 
would be along an ellipsoid with the observer 
at a focal point. Clearly the observations upon 
which this conclusion was based are very 
similar to those expected from the equidis- 
tance tendency and probably represent a 
modification by the equidistance tendency of 
judgments involving stereoscopic cues, 

Several studies concerned with aftereffects 
also may support (at least inferentially) the 
notion that the equidistance tendency can 
modify stereoscopic judgments. In studies by 
Köhler and Emory (1947), Bergman and 
Gibson (1959), and Howard and Templeton 
(1964), it was found that the prior presenta- 
tion of inspection figures physically arranged 
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in depth and viewed binocularly affected the 
perceived depth involved in subsequently pre- 
sented test figures. Köhler and Emory related 
their results to satiation theory. Bergman and 
Gibson described their results as a negative 
aftereffect. A physically slanted, textured sur- 
face with either steady or moving fixation 
appeared less slanted with continued observa- 
tion. A subsequently presented surface in the 
same locality had an error in Perceived slant 
which was opposite to that found with the 
original presentation, Howard and Temple- 
ton explained their results by assuming that 
the inspection stimulus appeared less three 
dimensional than it actually was (the equi- 
distance tendency) and, consequently, that 
the subsequent stimuli appeared distorted in 
depth in a direction Opposite to that of the 
distortion in the inspection display. Since at 


d in these studies 


Modifications 


ibe of Judgments Involving Size 


1956b, Experiment IV) using the Jar 
eral separation, the equidistance tendency 
resulting from the introduction of an addi- 
1 Perceived depth 
normally occurring between the cards, It was 
found that the additional object clearly modi- 
fied the perceived depth between the cards, 


It appears likely, therefore, that the increase 
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"i 


in apparent depth with increased Tater | 
Separation which occurred in the previous oe 
periment represented a decrease in r 
strength of the equidistance tendency, 00 
increase in the strength of the size ci 
Seems that the equidistance tendency a 
modify a perceived depth even though ae 
posed by the size cues which occur from 
miliar or nonfamiliar objects. ibson 

In the study by Bergman and Gi ja 
(1959) the inspection figure consisted © 
textured surface (burlap) which was ve 
through an aperture in order to eliminate 
Perception of the edges of the surface. 
texture of the slanted surface provide 
gradient of size changes (size cues) for 
Perception of slant. There was a tendency y 
this surface when viewed either monociti 
or binocularly to appear frontal with © 
tinued observation, From the viewp?!? 
the present Paper, this result represe”? f 
modification by the equidistance tendent. ue 
the depth perception resulting from size 
alone or from size and binocular disP 
cues together, 


arity 


f 
OTHER INSTANCES op tHe Errects 0 í 
EQumistance TENDENCY PPP 

Visual conditions in which the equidist tci 
tendency would have some perceptual ox: 
Therefore, it is tO pib 
pected that phenomena which can be 4 otel 
ncy would have bee y of 
ither r implicitly in a vat’ 
Situations. This is indeed the case, aS ria 


as d 
demonstrate, Judd ” te 
ported (1898) that white rair presei at 


IS at leas 
tion and 
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Which it is viewed. Simon (1936) discussed 
Some of the early observations on perceived 
distortions which related to his own experi- 
hee and, it would seem, also relate to the 
cee tendency. Gibson and Carel 
lie N reported that a display of stationary 
ceived mn a reduced cue situation was per- 
Also A wi Occupying a frontoparallel plane. 
iens ~ results from a number of experi- 
eck i na the perception of slant 
1959. Cl: 3ibson, 1955; Bergman & Gibson, 
4 lark, 1953; Clark, Smith, & Rabe, 
> 1956a, 1956b; Gibson, 1950b; Gibson 
Smith E 1957; Gruber & Clark, 1956; 
travrian 56, 1959, 1964a, 1964b, 1964c: 
Smith ae 1945) _or curvature (Smith & 
What a 1961) indicate that, under some- 
Cutvatur uced cue situations, the slant or 
nderesti, of a Stationary surface is usually 
Serene that is, the perceived slant 
towarq i curvature of the surface tends 
that Tontoparallel plane. It will be noted 
agree hg results represent phenomena in 
that oy With the equidistance tendency in 
of the cee the tendency for all parts 
Este ace to appear equidistant. ess 
Studies i eng Park (1964), in considering 
tation volving the effect of the depth orien- 
Orient, © ground upon the apparent depth 
“There on" of the figure, concluded that 
fag. 'S Considerable evidence that the slant 
Of its we tends to be assimilated to the slant 
View oi und [p. 184].” From the point of 
ae present paper, the equidistance 
Conditi Y Is responsible for this process. Under 
of th “rs such that cues of the orientation 
wil Sure are totally reduced, the figure 
F tidis on the ground according to the 
© fleur ce tendency; that is, each part of 
‘tion Vill appear at the depth of its di- 
di uya acent portion of the ground. If 
Stance ea cues is only partial, the equi- 
ispa acy will be able to only par- 
ees the apparent slant of the figure 
10n of the apparent slant of the 
Cues as to the orientation of both 
ps Sround are reduced, both the 
šround will ter appear in a 
Pat the aS Previous discussion, it is clear 
Possible effect of the equidistance 


ation 


fig pere 
fro © and 
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tendency must be evaluated in many experi- 
mental situations involving either a complete 
or a partial reduction in depth cues. If the 
effect of the equidistance tendency is not 
adequately considered in such situations, the 
interpretation concerning the events determin- 
ing the perceptions can be seriously in error. 
Also, however, as has been noted, the equi- 
distance tendency can be useful, as well as 
being a source of distortion of experimental 
data. Harker (1962) and Howard and Tem- 
pleton (1964) have applied the equidistance 
tendency toward an explanation of binocular 
depth contrast (Werner, 1937, 1938), and 
Gogel (1964a) used the equidistance tend- 
ency in order to modify perceived position 
without varying binocular stimulation in a 
study concerned with the perception of depth 
from binocular disparity. Another possible 
application of the equidistance tendency is as 
an explanation of the ability of observers to 
make relative retinal size judgments under 
reduced cue conditions with monocular ob- 
servation and the successive or simultaneous 
presentation of stimuli (cf. Epstein, Park, & 
Casey, 1961; Hartman, 1964). Contrary to 
the discussion of this problem by Epstein, 
Park, and Casey (1961, p. 493) the equidis- 
tance tendency has been demonstrated to 
occur between simultaneously presented ob- 
jects when all, rather than only some of these 
objects, are monocularly observed (Gogel, 
1956a). Furthermore, there is evidence that 
an equidistance effect can occur between di- 
rectionally adjacent but successively presented 
stimuli. This is indicated by a study by 
Gruber, King, and Link (1963) in which a 
moon illusion was demonstrated using two 
artificial moons and the prior presentation of 
a luminous ceiling. It can be inferred from 
the ratio of the perceived sizes of the moons 
that the artificial moon which was direction- 
ally adjacent to a more distant portion of the 
ceiling, in general, appeared to be the more 
distant moon, However, even under reduced 
conditions of observation, the equidistance 
tendency may be ineffective when large direc- 
tional separations occur between the success- 
ively presented objects (cf. Rock & McDer- 
mott, 1964). 
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APPLICATIONS OF THE EQUIDISTANCE 
TENDENCY 


Application to the Cue of Relative Direction 


A monocularly observed object which is 
viewed against a surface extending in depth 
will be perceived as resting on the surface. If 
the more distant portions of the surface are 
higher in the field of view, moving the object 
upward will make the object appear to be 
more distant. This was illustrated in a figure 
by Gibson (1950a, Fig. 72) in which two 
rectangles of different sizes were made to ap- 
pear reversed in size by being placed at dif- 
ferent heights and hence appeared at different 
The same phe- 
nomenon has recently been used by McDon- 
investigation of 
d size and dis- 


On where the line 


r the bottom of the 
rectangle intersects the surface, 


It is of interest to note that if the line of 
sight to the bottom of a monocularly observed 
rectangle were higher than the most distant 
portion of a floor surface, the rectangle would 
appear (according to the equidistance tend- 
ency) to be at the more distant part of the 
scene, since the Tectangle is closest to this 
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part in direction (cf. Gibson, 1950a, Fig. o 
This observation illustrates the advantasn 
classifying the phenomenon with the er i 
tance tendency rather than with the fac Ey 
optical contact. If the surface extend 
depth were above rather than below eye 
(a ceiling rather than a floor), an nav 
movement of the rectangle would be per ip: 
as resulting in a decrease rather than urhot 
crease in distance, Similarly, if the vail 
were a left or a right wall, a lateral se i 
of the rectangle would result in ac int h 
perceived distance. Neither height the sf 
visual field nor optical contact with assit 
face adequately explains the range 0 Fest" 
Phenomena, All these phenomena, ore ct 
can be considered as part of the ao eq” 
eral perceptual event described as t ‘ 
distance tendency, in Ue 
According to evidence presented pi 
paper, in the absence of other 
visual cues an object will tend to be he ê y 
tually localized in a visual field bY ‘dis 
distance tendency. The final ed" y w 
effect in this case is the resultant ° arti 
equidistance tendencies between the obi 
lar object and all contours and omen ja 
in the visual field, with the most a ial 
contours or objects having the greater ue 
ence. Such an effect is clearly of ew 
When using a comparison field to me rd `i 
Perceived size or distance of a stand’ res"! 
perimental) 
which are du 


the 


rê 


object. Experimenta that 


Application to Emmert 

As usually ex 
that the 
directly 
surface 
tions ar 


s Law 
pressed, Emmert’s ! ima ye 
Perceived size of an afte" á 
Proportional to the distanc? de 
on which it is perceived. Ta f y 
e basic to the investigatio” nine gd 
mert’s law: (a) What factors dete!” ie 
Position of the afterimage in the vis etal 
(b) How does the Position of the a“ pt Bi 
1n the visual field determine its appar ol 
he first of these two questions 
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volves the equidistance tendency. A conveni- 
ent method of demonstrating Emmert’s law 
1S to measure (by means of a comparison 
object) the apparent size of a foveal after- 
‘mage when the observer fixates one and then 
ag of several surfaces at different dis- 
ia se One question which might be raised 
Ra ing this procedure is why the after- 
ea appears at the distance of the fixated 
askin e. This question is the same question as 
ot E why the image from a movie projector 
taice od other projector) appears at the dis- 
B 0 the screen. The answer is found in 
appesa stance tendency. The afterimage 
it fe z at the distance of the object to which 
ace o rectionally most adjacent and the sur- 
ot n which it is made to appear represents 
rection ai directional adjacency, that is, di- 
ity sh a identity. However, directional ident- 
em ould not be an essential condition for the 
i Snstration of Emmert’s law. Consider the 
rented ìn which a vertical surface is pre- 
Observer One of several distances from the 
a diet, Suppose that the observer looks in 
the foy ‘on slightly above the surface so that 
this a afterimage appears slightly above 
*eetionali in direction. If the surface is di- 
erim y the most adjacent object to the 
equi dane the afterimage, according to the 
ely ance tendency, will appear approxi- 
Surface at the distance of the surface. If the 
istane S successively presented at different 
Adjacens while maintaining its directional 
parent. with respect to the afterimage, the 
Just... $!2e of the afterimage should change, 
face > it does when it appears upon the sur- 
relate it itself, the afterimage has no cues to 
® anot to the visual field. It appears at one 
lete, -C7 distance in the visual field com- 
ten ‘tig consequence of the equidistance 
teng er i follows that the equidistance 
tion op 3, IS an essential part of any explana- 
“mmert’s law. 

Of g H Westion concerning the apparent size 
sive Afterimage has at least two possible 


ped pe might be suggested that the per- 
Artig 
Sbsep,, a t 
Bestio 


n B invariance hypothesis (cf. Ep- 
> & Casey, 1961) and thus would 


159 


be subject to the evidence for and against 
this hypothesis. The role of the equidistance 
tendency in this process would be to deter- 
mine the apparent distance by perceptually 
localizing the afterimage with respect to other 
objects. Another possible answer can be con- 
sidered by returning to the example of an 
afterimage appearing at the distance of a 
directionally adjacent (or directionally iden- 
tical) vertical surface. It has been asserted 
(Gogel, 1964b) that the perceived size of an 
object subtending a particular visual angle is 
determined by the perceived size (S’) per unit 
of visual angle (8) of objects in its apparent 
depth vicinity. The contribution of the equi- 
distance tendency to this process would be to 
make the afterimage appear at the depth vi- 
cinity of the surface and thus determine that 
the S’/@ value of the surface would become 
that of the afterimage. Clearly, for either of 
these two possible explanations of the appar- 
ent size of afterimages, the localization of the 
afterimage in the visual field by means of the 
equidistance tendency is a necessary event. 


Applications to the Moon Illusion 


Conclusions which apply to the perceived 
size and perceived distance of afterimages 
should also apply to the perceived size and 
perceived distance of the moon, and hence to 
the moon illusion (cf. Kaufman & Rock, 
1962b; King & Gruber, 1962). The moon has 
few cues to localize it in distance with re- 
spect to other objects. Therefore, as a conse- 
quence of the equidistance tendency, it should 
tend to appear at the distance of objects di- 
rectionally adjacent to it. Upon appearing at 
a particular location, the moon might appear 
to be a particular size as a consequence of 
(a) its perceived distance (the size-distance 
invariance type of explanation), or (b) its 
relative retinal size as compared with famil- 
iar objects in its perceived environment (the 
S'/0 type of explanation). Clearly, with 
either type of explanation, the equidistance 
tendency is a necessary explanatory factor, It 
follows that the perceived size of the moon 
should be capable of being systematically 
changed by using the equidistance tendency 
to make it appear at different distances or at 
the distances of objects with different values 
of S'/0. For example, suppose that an op- 
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tically generated disc of light (a moon) with 
a diameter subtending a visual angle of ~~ 
degree is presented directionally above a 3.5 
inch high playing card located at a distance 
so that its height subtends 1.75 degrees of 
visual angle. No other objects except the 
playing card and moon are visible, and all cues 
to the depth between the moon and card are 
eliminated. Under these conditions, the equi- 
distance tendency would result in the moon 
appearing at the distance of the playing card 
and according to the S'/ type of explanation 
the moon would have a perceived diameter 
of about one inch. Thus, the moon can be 
made to appear any size by having it appear 
at the distance of objects having particular 
values of S’/@ (or at particular, perceived 
distances).? It can be concluded, therefore, 
that the observation that the horizon moon 
looks larger than the zenith moon 
penstance of the particular visual conditions 
usually encountered, since the appropriate 
manipulation of the visual conditions should 
be able to destroy the illusion or reverse its 
direction, 

The above explanation of the moon illu- 
sion is consistent with the results of the re- 
cent research on the moon illusion by Kauf- 
man and Rock (1962a, 1962b), King and 
Gruber (1962), and Rock and Kaufman 
(1962) in which evidence is presented against 
the “angle-of-regard hypothesis” of the moon 
illusion summarized by Boring (1943). How- 
ever, unlike the S’/9 aspect of the explana- 
tion asserted in the present paper, Rock and 
Kaufman (1962) concluded that, in general, 
the perceived size of the moon is determined 
by its perceived distance and not by its com- 
parison with surrounding objects, The latter 
explanation is rejected because terrain modi- 
fications can affect the perceived size of the 
moon and also because the moon illusion 
seems to occur over water or desert where 
there are no other objects of Particular, per- 
ceived sizes. However, it is likely that modi- 
fications of the terrain would produce changes 
in the value of S’/@ in the visual field at the 


is a hap- 


2 Boring (1943, p. 57) probably provides an illus- 
tration of this phenomenon when he notes that the 
apparent size of the horizon moon shrinks when 


viewed between the thumb and forefinger, 
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apparent position of the moon, and it is a 
cult to imagine a visual field which con m 
no values of S'/@ but still maintains ae 
ceived distances required for the size-dis 
invariance type of explanation. ived 
However, regardless of whether pene 
size is to be understood in terms of ae 
distance or S'/4, the moon illusion, ee 
mert’s law, requires the equidistance berr, 
as part of its explanation. There is no n n 
for example, to expect modifications th 
terrain to change the perceived size T e 
moon unless there is some factor e ter- 
the moon appear at some portion of t moo! 
rain. There is no reason to expect ig sky 
to appear on the apparent dome of orizon 
(cf. King & Gruber, 1962), or for the ; ats 
moon to appear at the distance of a as to 
the horizon, unless there is some fa gons 
cause the moon to appear at these po vidi” 
Tt is asserted that this factor is the €a ent! 
tance tendency, The equidistance Kaat eld 
localizes the moon in depth in the visu@ on 
and consequently provides an essent is ip 
dition for the Operation of the pro jved 
volved in the determination of its p€" 
size, 


per 
The moon illusion is only one of & ni 
of naturally occurring perceptions to "fot 
the equidistance tendency can be apple pute! 
example, Gibson and Flock (1962) me ao 
an illusion concerning the underestim4 g. 
the distance of distant mountains t° uid” 
versal in the optical gradient. The e ane 
tance tendency provides a different oF 4 
tion of this illusion. The nearest mount 
a mountain range are localized perceP gin 
with respect to the visible terrain e*t? jon 
from the observer, The mountains an ie 
behind the nearest mountains prov a 
Cues to determine their depth position” eh | 
aS a result of the equidistance tenden” pow 
apparent distance behind the nearest | fror, 
tains, and thus their apparent distan® are 
the observer, would tend to be U? comm 
mated. It seems that the perceptu we 
quences of the equidistance tendencY well 
considered in naturally occurring 25 
in experimental situations. 


ual 
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POSSIBLE EXPLANATIONS OF THE 
EQquiistaNnce TENDENCY 


The requirements which must be met by 
A En to explain the equidistance ten- 
aia E speciñed by the range of phenomena 
point oo the tendency. From the 
planation me be the present paper, any ex- 
nomena: Ist encompass the following phe- 
ioe equidistance tendency occurs be- 
any cont y Visual inhomogeneities (between 

2. Th Ours or parts of a figure or surface). 
distance ee or effectiveness of the equi- 
Tectional pig is inversely related to di- 

Separation (in any orientation). 
tween m equidistance tendency occurs be- 
objects Saa viewed monocularly, or between 
Jects a binocularly, or between ob- 

söre | of which are viewed monocularly 

4 m € Pinocularly, 
teduce a presence of strong depth cues can 
“quidist eliminate the effectiveness of the 
i ance tendency. 
tang — certain conditions, the equidis- 
tesy] ma dency can modify depth perceptions 
dispa Ne rom cue systems such as binocular 
-mo and relative or familiar size. 

Ne equidistance tendency can act as a 
anq effect occurring between one object 
compl ; 
1961) presented an ex- 
Ot the process by which monocularly 
“ame = Objects appear at approximately the 
ne patent distance as binocularly ob- 
we i ees: This explanation asserted that 
the its, the monocularly observed object 
anu ated fad fuses binocularly with an un- 
ĉa j ut approximately corresponding 
© other eye. Since the resulting fu- 
a Will tend to be with respect to 
eq abier points, the monocularly ob- 
are dist will tend to appear at the same 
Drege © det, T as the binocularly observed 
he Writer owever, it is difficult for the 
teng Plieq ns t see how this explanation can 
er Cases in which the equidistance 


Coreg Pro 


€ 
ej Ay Gn 
itt compared between objects observed 

Cula? etely monocularly or completely 


erg, : 
tegy, Sma ; 
Sults p; n and Gibson (1959) described the 
“r study as indicating the occur- 
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rence of a negative aftereffect. The tendency 
for the slanted surface to appear in a fronto- 
parallel plane is attributed to a perceptual 
shift toward a norm and is called normaliza- 
tion. The effect of the normalization is as- 
sumed to persist over time and results, for 
example, in subsequently presented fronto- 
parallel surfaces appearing tilted in the op- 
posite direction. It is possible, therefore, that 
the concept of normalization might provide a 
basis for understanding the equidistance tend- 
ency. There are several reasons, however, for 
considering this to be unlikely. It is not ob- 
vious why the effectiveness of a normaliza- 
tion would be inversely related to directional 
separation as is required by the equidistance 
tendency. Furthermore, since the equidistance 
tendency can occur in visual fields containing 
a large number of different objects at differ- 
ent distances, an explanation in terms of nor- 
malization would require the concept of a 
large number of competing normalization 
tendencies. The range of phenomena involved 
in the equidistance tendency is such that an 
explanation of the tendency in terms of any 
simple concept of normalization is not apt 
to be successful. 

Several of the discussions of the equidis- 
tance tendency (Gogel, 1956b; Roelofs & 
Zeeman, 1957) suggest that the tendency can 
be treated as a factor possessing both magni- 
tude and direction. It would seem reasonable, 
therefore, that the equidistance tendency can 
be additive or subtractive with respect to 
other cue tendencies in the perceptual locali- 
zation of an object in apparent depth (cf. 
Gogel, Brune, & Inaba, 1954). Also, equidis- 
tance tendencies can subtract or add in de- 
termining a final equidistance vector. Thus, 
in the present paper, it is assumed that the 
equidistance tendency adds a depth factor to 
any visual field containing more than a single 
point of light, and that as a depth factor it 
enters quantitatively in competition or agree- 
ment with other depth factors in the deter- 
mination of a final apparent position of an 
object in depth. 

SOME UNSOLVED ASPECTS OF THE 
EQUIDISTANCE TENDENCY 


Tt has been reported that the tendency for 
surfaces slanted in depth to appear in a more 
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frontoparallel orientation changes with the 
amount of observation time (Bergman & Gib- 
son, 1959; Clark, 1953; Simon, 1936; Smith, 
1964). It is therefore possible that the effec- 
tiveness of the equidistance tendency (relative 
to whatever cue systems are available) changes 
with observation time. Perhaps in the absence 
of effective opposing cue systems or tend- 
encies, the perceptual consequences of the 
equidistance tendency are rapid. This is sug- 
gested by the ability of the equidistance ten- 
dency to determine the perceived path of 
movement of an object attached to the rotat- 
ing trapezoidal window with the entire display 
viewed monocularly (Gogel, 1956a). The 
rapidly changing directional alignments in the 
study with the rotating window required that 
the effect of the equidistance tendency be 
rapid in order for it to determine apparent 
depth. Without opposing cues or tendencies, 
the equidistance tendency may be quickly 
effective, When cues of depth are present, the 
relative effectiveness of the equidistance tend- 
ency (relative to that of the depth cues) pos- 
sibly increases within some interval of ob- 
servation time. Research is required to clarify 
the temporal characteristics of the equidis- 
tance phenomena both for simultaneous and 
successive presentations. 
Additional aspects of the equidistance tend- 
ency should be considered. The notion that 
the equidistance tendency between two ob- 
jects is a vector of a particular magnitude 
would suggest that increasing the number of 
objects at a constant distance would increase 
the strength of the equidistance tendency to- 
ward this distance. In a similar fashion, the 
strength of the equidistance tendency be- 
tween objects might increase with an increase 
in their visual area. Also, the meaning of the 
term directional separation requires clarifica- 
tion. Is the significant directional separation 
to be defined physically, retinally, or percep- 
tually? In most experimental situations a dis- 
tinction between the three types of definitions 
would not need to be made. However, it is 
possible to construct experimental situations 
in which the three types of directional sepa- 
ration would not produce identical results. 
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d. A number of tested and testable propo- 
social behavior emerged from a rap- 
ties, and concepts associated with open- 
e to consider the dimension of group 
-psychological research poses a question con- 


It has long been recognized by 
theorists (Hemphill, 1950) that 
ity is an essential construct wi 
group behavior. However, even 
vestigation of research reports cı 
cial behavior reveals that social s 
been largely concerned with 
membership remains constant 
tional model of social research 
morphic with regard to social systems as we 
would know them if we did not ignore or re- 
press the minatory forces of illness, death, 
and changes in group composition in general. 

Certainly a basic characteristic of social 
systems is that they maintain themselves in a 
state of perpetual change of components. It is 
true that within a given time span the compo- 
sition of the social system appears static. 
Perhaps we are deceived by our own Parallel 
movement along the time flow and the rela- 
tively steady state of the system at the molar 
level. 

Although Frenkel-Brunswik (1949) 
Heider (1958, pp. 173-194) hypothesi 
dividual differences in res 
general, theory and resear 
general response to ch 
tion are equivocal. Th 


organization 
group stabil- 
th regard to 
a cursory in- 
oncerning so- 
cientists have 
groups whose 
. This tradi- 
is scarcely iso- 


and 
zed in- 
ponse to change in 
ch with regard to a 
anges in group composi- 
e equivocation probably 
evolves from the novelty effects inherent in 
the change condition involving moderate 
stimulus intensity, and the hypothesis ( Ber- 

1 This investigation was su 
Force Contract Number AFO 
Air Force, Air Force Office 
Office of Aerospace Research, 


Pported in part by Air 
SR 62-95, United States 
of Scientific Research, 
Washington Zo), D G 


p stit 
lyne, 1960) that human beings norman o 
to maintain an intermediate amount n few A 
potential. Although there have a esis | 
tempts to generalize Berlyne’s ae i y 
conditions of group member homeo sa st 
hypothesis suggests higher member ie De 
tion is associated with a modicum es " 
ship change. Although the intensi eult A 
membership change stimulus is memb”, 
define, it will be assumed here tiaw e v 
ship change usually results in mer sti yf 
arousal potential. Assuming high m ers 
intensity under conditions of me achin 
change, it is proposed that in app! | ay 
d 


ra 
ynamic environment involving an ® 1 
familiar and unfamiliar social objects: nd jn 
sion level of a cognitive organis™ ve # 
increase as a consequence of negn u 

forcement in Previously encountere n a y 
tured situations, In addition, an n o 
situation poses a threat to individ jen” 
ity. A confrontation of the self may, n d 
a reevaluation of the self, which, and | 
demands a redistribution of activity dt! 
ditional energy expenditure, Thus, ¥ eral i 
Simon (1958) Propose that “in gen p“ 0s 
up to a fairly extreme point, increas® for “i 
ability yields increased satisfaction 5 vio! 
People [p. 94].” Becker (1950, P- osi ad 
Ports this latter viewpoint. His Oe the ip’ 
described the hostility which atten? jo", 


stable ? es 
vent of a newcomer to relatively stä? cyc? 
Since the newcomer represents pe ef 
change. 


n 
mo 
On the other hand, at the com 
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level of psychology, a number of assumptions 
ave been made concerning the positive as- 
Pects of membership change. It is said, for 
example, that newly commissioned officers in 
military units in the 1930 decade were re- 
oe frequently in order to avoid constant 
i of initial failures. Highly capable 
ie and members of professional baseball 
on sige disappointing records are replaced, 
fi need for ‘new blood” or a new en- 
Hieni a 1s offered in mythical explanation. 
hes: oq is widespread as to the number of 
safely at a given group can profitably or 
“a retain the same leader (McVey & 
ere poet Finally, changes in personnel 
and oth ane of embezzlers. In general, these 
group er speculative change strategies in 
theor ey derive from an implicit 
a open-closed groups. ; 
Cernin i the first systematic analysis con- 
fuid 4 e open-closed group phenomenon is 
1950) m the writings of Simmel (Wolff, 
sively o eecneally, Simmel speculated exten- 
Oncerning the unique characteristics 
munity Sager in an otherwise stable com- 
iscussed Che nucleus of the characteristics 
ility ea composed of the concepts of mo- 
objectivity. “The fundamentally 


mobi 

il 

or „© Person comes in contact, at one time 
another 


gle one [p. 404].” Moreover, the 
is not radically committed to the 
tedients and peculiar tendencies of 
specif, and therefore approaches them with 
T A attitude of ‘objectivity’ [p. 404]. 

the ¢ AT Simmel was concerned largely with 
Tents ecteristics of the stranger, the argu- 
do op aso have implications for groups which 
iS, o ne hot have contact with strangers, that 
Broup ¢ and closed groups. The open-closed 
analogoa Pt derives rather broadly and only 
fta — from a general systems theory 
OF . (Bertalanffy, 1950). A basic unit 
“set” 8PProach is a “system” defined as a 
Ip, 23 | » elements standing in interaction 
May be Systems in this very general sense 
\ Closeq tesorized as either open or closed. 
Ment. | S¥Stem is isolated from the environ- 
Pature o elements are excluded and the de- 

Ol elements is precluded. It is pro- 


the > OUP, 
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posed that most living organisms are initially 
open systems, since they are maintained “in a 
continuous inflow and outflow, building up 
and breaking down of components, neither be- 
ing, as long as it is alive, in a state of . . . 
equilibrium but maintained in a so-called 
steady state . . . [Bertalanffy, p. 23].” 

Extending this framework to social systems, 
an open group may be defined as an interact- 
ing set of persons in a continuous state of 
membership flux; for example, an Army unit 
under combat conditions. Under these condi- 
tions of social metabolism, the elements of 
the complex are temporarily interrelated 
owing to periodic or frequent but unscheduled 
replacements, removals, or additions of ele- 
ments. In closed groups, on the other hand, 
the elemental composition remains constant. 
Polaris submarine crews, for example, remain 
submerged for three months without even 
radio contact with the outside world. 

Four fundamental characteristics appear to 
distinguish open groups from closed groups: a 
reduced time perspective; the achievement 
and maintenance of steady states (equilib- 
rium); an expanded frame of reference; and, 
of course, changing group membership. Under 
the heading of changing group membership, 
we shall be concerned with the topics of mo- 
bility and power, assimilation process of the 
newcomer, and leadership change. 


TIME PERSPECTIVE 


By definition the lives of the members of 
closed groups are inextricably intermingled 
for a given period of time. Conversely, mem- 
bers of open groups are aware of the transi- 
tory nature of these relationships. The future 
is indistinct; the present dominates. Imple- 
mentation of group decisions tends to be acti- 
vated with greater haste. If the present mem- 
bership is to benefit from a given group ac- 
tion, the action must be undertaken in the 
immediate future. Postponement may be tan- 
tamount to inaction for those members who 
may be removed or replaced. 

Sorokin (1927) states that contemporary 
Western society demands a reduced sensi- 
tivity to the environment. He contends 
(p. 518) that our nervous system would be 
wrecked completely if it were sensitive to 
the innumerable phenomena which surround 
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us in our permanent mobility. At least two 
alternatives are possible, group orientation or 
self-orientation. : 

Group structure growing from the members 
attachment to the group as a group rather 
than to unique relationships with individual 
members may insure a degree of stability in 
an otherwise changing social environment. 
For example, in order to insure the mainte- 
nance of the norm of reciprocity within an 
open group, loyalty to the group must be 
assumed. Reciprocity is defined (Gouldner, 
1960) as the “mutually contingent exchange 
of gratification,” and is assumed to be a uni- 
versal norm. Minimally, reciprocity demands 
that: (a) people should help those who have 
helped them and (4) people should not injure 
those who have helped them. In Open groups, 
however, a system of sanctions may be diffi- 
cult to maintain. Thus a beneficent member 
of the group may fail to be recompensed by 
another member who departs before compen- 
sation is possible. It is Proposed that in open 
groups interpersonal interdependency sur- 
renders to Person-group 
Members bestow fa: 
the name of the gr 
larger organization 


of which it is a Part is 
obligated to assume 


the responsibility of re- 
ciprocation. Herein lies the concept of loyalty 
and its rewards. In closed groups, then, the 
reciprocity norm functions among individuals; 
in open groups, the reciprocity norm func- 
tions between the individual and the group 
as a group. 
Self-orientation ma: 


y also provide a rela- 
tively stable basis in 


an otherwise changing 
social field. Thus, in anticipation of being 


forced to change groups, the members may 
become primarily concerned with transferrable 
acquisitions, that is, returns or benefits ad- 
vantageous primarily to the single mobile 
member. 

To counteract the Possibility of this short- 
term perspective, the group may invoke the 
concept of a universal group or a universal 
philosophy or religion to which all men belong 
or adhere. In this way, intergroup mobility be- 
comes intragroup mobility and a degree of 
stability is achieved by the system. 

Consideration of time Perspective in open 
and closed groups has implications for formal 
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a 
game theory. Thus, it is proposed be 
player does not select a strategy to m ups 
individual gains as readily in closed ra i 
as in open groups since extra-game A 
an interpersonal nature may be involved. 


EQUILIBRIUM 


sch assot! 
The theoretical framework which 


ates open-closed groups and ag ree 
sketched by Chappel and Coon (1942). ait 
A state of equilibrium may be defined as eee or 
a small force is impressed upon a system, 4 P nce 
adjustment takes place within the system, rox” 
that force is removed, the system returns t0 
imately its previous state [p. 43]. mber 
One such force is a change in group Mê 
ship. inst? 
It has been repeatedly suggested that ! a 
bility is a basic shortcoming of an open i tune 
system, Changing leadership at an TODE dde” 
time, a rapid turnover of personnel, tity? 
exodus of members, a loss of group ee add 
concern for sources of replacements = s cial 
tions, and the unpredictability of futu” inbhe 
contacts are just a few of the problem yed 
ently involved in open groups. It 15 sprit! y 
here that a constant state of disequili a that 
nonfunctional. Thus, it is hypothes!? rott 
the members of open groups develop ects , 
dures which minimize the disruptive © ate? 
membership changes and establish a tio” 


. se . r 
dynamic equilibrium through consi “A; and 
of group size, homogeneity of person” 
turnover rate, 

Group Size or? 

It is proposed that group size iS, P ose 
critical variable in open groups than 1? i 
groups, particularly with regard to vers if 


groups. Tf the group is composed of 0P zov 
persons, the departure of one of these we 
the group. Even with three members, PO es 
there remains a substantial] probabilit ar 
two members may leave simultaneous?’ tji 
ticularly in view of a possible concaté 
effect. Since the entire organization Ye 
threatened by the loss of a member e 
small group than a large group, the pee e 
of small groups may overreact to the a a 
Sroup disintegration and leave the Brn iti. 
maturely, Moreover, under these CO? mo” 

undue power is inherently accorded me 
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Individual, since he possesses the power to 
destroy the group. Group size is a critical 
consideration in terms of group stability from 
thet point of view. The probability of 
pe ility between the new member with at 
tag other member is increased with an 
roces ane group size. Thus, the assimilation 
reame "9 be presumed to be facilitated by 
thereby S the number in the host group, 
Wie we likelihood of the new- 

ente T in the group. If, however, 
group si ership is homogeneous, the effects of 

ze are attenuated. 

i oo membership heterogeneity, as 
license the group size, is presumed to 
and t rites Probability of matching members 
Stability y increase compatibility and group 

eterogen under open group conditions. A 
Clitates eous grouping of personnel also fa- 
tensive ne stability by providing a more 
an ade Pool of replacements which insures 

uate flow of personnel into the unit. 


T 
Urnoyep Ra te 


To 
arity (1960) assumes that “turnover nec- 


Workin Isrupts informal understandings and 
: coord angements” and therefore affects 
discussi nates of the group’s activities and 
tails, 00S. Moreover, turnover usually en- 

Dither of part of the group’s “memory.” 
May re Sing the rate of turnover presumably 
lems “ the height where personnel prob- 
toup, t Precedence over productivity. Open 
“en, are required to conduct member- 
NUD tive wees at a rate which is minimally dis- 
blog. | One means is to make changes in a 
(Tow ect than extend the transition period 
bat eoa However, under military com- 
block o tions, the replacement of a large 
fidence Personnel may interfere with the con- 
ay me the remainder of the organization 
Stregs (,'Mterfere with its effectiveness under 
Of tur o Onymous, 1946). The optimum rate 


tage Mover appe i 
a tion of the 

demands pears to be a functio 

M 


relat; a 

Member tlonship between turnover rate and 
ae Grog 'P satisfaction was reported by Dar- 
j Bitls ©, nd Martin (1951). Satisfaction 
with e own dormitory life was 
ated negatively to the stability 
Pinay tory population. 

> à relationship between growth rate 


167 


and institutional strength is hypothesized by 
Chapin (1957). On the basis of a comparison 
of eight old and eight young churches, it was 
hypothesized that weak institutions are a re- 
sult of disintegration from burgeoning group 
size or lack of new blood. 

Tt is also assumed that one of the dangers 
of membership change is oscillation due to an 
overreaction to change. In order to counteract 
this tendency and to maintain efficiency dur- 
ing readjustment, open systems may develop 
a lag or decrease in sensitivity to change 
(Herbst, 1954, p. 333). Lag is the amount of 
time required before changes in one com- 
ponent are communicated or transferred to 
other components. One concrete means of 
building a lag into the system is to suggest 
that the host group members react to the posi- 
tion to which the new member has been as- 
signed rather than to his characteristics as a 
unique individual. Again, military replace- 
ments are an excellent example, since the posi- 
tion of the individual in these units is clearly 
depicted with appropriate symbols promi- 
nently displayed on the uniform. 

One of the inherent problems of open 
groups is the maintenance of a flow of per- 
sonnel in and out of the system, yet main- 
taining the steady state of the system. Here, 
age heterogeneity is necessary in order to 
avoid mass replacements as the members ad- 
vance in age. 

Assistance in making a successful and satis- 
fying transition to another group also con- 
tributes to group stability by reducing the 
tendency for former members to derogate 
the donor group (Caplow & McGee, 1958). 

Another empirical development for main- 
taining group stability among open groups 
such as colleges, military service units, or 
political groups is the organization of former 
members as in alumni clubs, veterans’ or- 
ganizations, political parties, and perhaps the 
‘Actors Guild. The functions of these latter 
more stable organizations are manifold but 
include maintaining the flow of personnel 
in and out of the organization which they 
serve in an auxiliary capacity and, perhaps, 
acting as an extended reward system within 
which the parent organization can exercise 
sanctions toward a transgressor who has left 
the organization by precluding membership in 
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the graduate organization. With these gradu- 
ate groups, then, a former member, in effect, 
can never escape the influence of the parent 
organization. In this respect, the open group 
assumes some of the characteristics of a closed 
group. Groups in which careers are short- 
lived or hazardous appear to demand the 
services of the extended group as a means of 
system stability. 


FRAME OF REFERENCE 


The term “frame of reference” commonly 
refers to the ground against which the figure 
is perceived (Newcomb, 1950, p. 94). We 
shall use the broader definition of the term 
which includes all the factors, objective and 
subjective, which influence the perception of 
the figure. 

It is proposed that open groups, in contrast 
to closed groups, have an expanded frame of 
reference (Redfield, 1947; Sorokin, 1927), 
These sociologists aver that members of an 
immobile society learn, as a rule, only a def- 
nite course of ideas, opinions, beliefs, and 
values. They have little opportunity to listen 
to many ideologies and learn different beliefs: 
and, as a consequence, their 
are inflexible. 

Several advantages inhere in the use of open 
groups or, in effect, a larger population as a 
frame of reference for status evaluations 
rather than a fortuitiously selected small 
closed group. In the expanded frame of ref- 
erence of the open groups in which transfers 
frequently occur, more accurate and more re- 
liable ratings of the members are possible, 
since all members tend to be exposed to paired 
comparisons with a wide variety of others. 
Similarly, more accurate self-concepts are en- 
couraged within open systems. Since it is less 
possible to insulate and isolate oneself, in- 
flated self-estimates do not remain unchecked 
or unchallenged for long: nor are erroneous 
self estimates buttressed for long by peculiar 
status achievements within a given small so- 
cial unit. 

These same arguments may be applied to 
groups. Thus, in open group systems, islands 
of cultural or organizational lag are less prob- 
able than in closed groups (see Redfield, 
1953). The proposition concerning an eyx- 

panded frame of reference in open groups also 
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NET 
has implications for group creativity 
change in group set. 


Creativity 


held thet 
Many social scientists have eee: 
creativity flourishes in societies where mbers 
a continuous flow of new exogenous cietie 
(open societies), and is curbed sd usual 
from which strangers are restricted. 1 will! 
rationale is that isolation results nd ral 
specialized conceptual frameworks : deem 
limitations on the kind of behavior edel 
acceptable in a given situation ( ropo? 
1947). Sorokin (1927), however, Pa with 
that in horizontally immobile societ! viol 
their “permanent and monotonous vention 
ment, there is little incentive for iV ditio™® 
and no favorable combination of anes 
which may suggest the ideas of ek othe” 
Empirical evidence supporting this = b 
sis involves research teams in indiski ye 
oratories (Shepard, 1955), The i 
gest that group productivity, a engt 
enthusiasms are negatively related d n n 
of association of the group member, tas 
laboratory study in which the creativ? for 3 
involved the composition of aptori whic 
Saturday Evening Post cartoon, grouP® ope 
experienced membership changes grou 
groups) were more creative than staP , G00" 
(closed groups) (Ziller, Behringer, i 
childs, 1962), ME 
The latter results were interpreted one Po 
of social facilitation (Allport, 1924): 
formances of the host and donor group’ 
e compared by the newcomer, whe 
judgments of the closed group’s pe! A 
is necessarily absolute. Thus intergroh pel, 
parison or competition may be the 1 emb? 
ate variable relating changing group 
ship and creativity, 
Change in Group Set j ou? 
: 0 
It is hypothesized that a change a aok 
composition facilitates a change in m e kp 
set toward group outcomes. For eami n 
Proposed that a failure atmosphere on a 
tudes constituting low group evalua me 
owing group failure are dissociato erm” 
readily if membership is altered. Furl oti! j 
it may be proposed that the eee 
group continuity provides a conve”! 
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for dividing the group history into two time 
ie Ba past and the future—and for 
deed pes | failure with the past. In- 
a » totowing failure, a change of any kind 
t a a plausible basis for dissociating 
eal” on and perceiving the future as a “new 
obtrusive new group era relatively free from 
(New A of past unpleasantries 
se is effic s Eve phenomenon). Change per 
changes in Any one of a variety of 
convenient as a rallying point or a 
erapeuti ense mechanism which serves a 
heed unction for groups embarrassed 
les of failure. 

in cee hand, it may be proposed that 
minimize i a history of success attempt to 
identity +3 Hae in order to preserve the 
ceptions mi e group and the members’ per- 
ning const the nebulous but accepted win- 
embers ore It is difficult for group 
Atists to hes as for professional social sci- 
‘hus, in ee ine the basis for group success. 
'cted that gt of group success, it is pre- 

every q he group members tend to adhere 
Pattern of etail to the previously successful 
Possible group behavior. Since this is 1m- 
hange under conditions of membership 
tially a change may be perceived as poten- 
rather o_o or as a threat to the group’s 
cessful Nponderable but, nevertheless, suc- 

Pattern of behavior. 


aane Group MEMBERSHIP 
tinguishes obvious characteristic that dis- 
th tress Open groups from closed groups is 
Pe Senn change in group composition. In 
8 re Men members may be added, removed, 
Posite ae Replacement, however, is a com- 
not pe yt Temoval and an addition so it need 
Geo separately. 
à E ath group composition may be stra- 
Etoup 4 erely in response to the exigencies of 
i Blam, We shall be primarily concerned 
Atro Uc Ned change. The entire section 1S 
Consideran y a discussion of power, a basic 
” gto tion in relation to planned changes 
P composition 
Owe, $ 


Ewin 
er a a tasi) has defined the power of b 
Ma towa the maximum force that b can put 
Td some region relative to the maxi- 
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mum resistance that @ can mobilize against 
b's force. Cartwright (1953) has used this 
concept to describe some of the consequences 
of power in relatively permanent groups or, as 
used here, in closed groups. He notes that a 
higher degree of power of b over a carries po- 
tential, whether or not b has ever used his 
power. In closed groups, b's power may be 
exercised either in the present or in the future. 

Thus, in closed groups, any act between a 
and b potentially has consequences for the 
power relationship between @ and b in the 
future. On the other hand, in open groups, 
either a or b may leave the group voluntarily. 
This contingency tends to reduce the power 
aspects of the relationship since the length 
of time is reduced over which either person is 
able to impose sanctions on the other. The 
person toward whom sanctions are being di- 
rected may be removed from the group and 
the other person’s sphere of influence; thereby, 
in Lewin’s terms, increasing the resistance 
that the person can mobilize against the 
other’s force by withdrawing from the field. 
Of course, to counteract the possible reduc- 
tion in length of time in which something may 
be exacted in open groups, the sanctions may 
become more severe and immediate in open 
groups. 


Power and the Newcomer 


Some particularly unique power considera- 
tions arise with regard to the newcomer and 
imminent egressor under open-group condi- 
tions. By definition, the newcomer has re- 
cently been a member of another group, often 
a group quite similar to the host group. At 
the very least, then, the newcomer possesses 
a more extended frame of reference than 
those members of the host group who have 
worked exclusively within the boundaries of 
the host group. In terms of interpersonal eval- 
uation, the newcomer is in a position to make 
intragroup and intergroup comparisons among 
the group members. Similarly, the newcomer 
is in a position to render invidious compari- 
sons between the host and donor groups; but, 
of course, the group is in a similar position if 
the member is a replacement (Gouldner, 
1954). 

Various subgroups among the main body 
may become concerned as to in which sub- 
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group the newcomer will cast his lot, for by 
becoming aligned with one or the other of the 
various informal and sometimes competitive 
or even hostile subgroups of an organization, 
the equilibrium of forces within the host 
group may be threatened. Thus, the allegiance 
of the newcomer may be actively solicited by 
each of the subgroups, and the newcomer, 
through his position as an uncommitted mem- 
ber and perhaps a pivotal member, is accorded 
a strategic power role in the group (Snyder, 
1958), 

Moreover, it has demonstrated 
(Wright, 1942) that the newcomer tends to 
be regarded as a guest rather than as a regu- 
ase of adjust- 
guest being to 
wcomer is in a 
mbers may find 
‘Wwcomer’s wishes, 
depart unexpectedly soon 


, the host group would be 
somewhat embarrassed publicly, 


a variety of ways. 
is related to power 
Once the newe 


in these terms is resorti 
host group members are Concerned, to rela- 
tively random behavior. The older members 
are themselves more Predictable and, in fact 
are constrained to be in relation to their 
position and role in the group. Thus, the 
regular members are more vulnerable to vari- 
ous game strategies than the newcomer. 

In general, some of the Power advantages 
of the newcomer have been outlined. Con- 
versely, various negative pow 


ng, so far as the 


er components 
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the informal group structure, cautious bee 
ior to preclude premature and oan 
evaluations, unfamiliarity with group oe 
and inability to check the validity of pe 
nel evaluations with other members. 


Power and the Egressor 


š t 
Another role peculiar to open groups i 
of the egressor, the member whose deP? it 
is imminent. In contrast to the newcomels sy 
hypothesized that the power of the oa 
diminishes directly in relation to the 1 wel f 
ing membership time. Group tenure and Fer 
are directly related. In Lewin’s (1951) rt 0 
again, since the egressor (E) will deps ae 


Í 
are inherent in the role: lack of knowledge ° 


a given date, the group’s ability to ize 
forces emanating from Æ may be er the 
through certain strategies. For Gank ot 
&roup may delay reaching a ee de 
acting upon a decision until after n th 
parture. E may even be omitted fr” 
decision-making group. ri 
By the same token, however, Æ oe, 
forces emanating from the group. T uct! 
military circles the phenomenon of jstan 
Motivation to action or increased re! riod 
to group pressure as the enlistment acking 
draws to an end is referred to as S% js? 
arms” or “going casual.” Moreovely 2 
potential threat to the donor orga” mt 
Since his loyalties are indeterminate, ® 
ginal man as it were, E may be free t° jipoi! 
negative aspects of the donor group 
fear of sanctions. 
Although these Propositions até m 
concerned with individual group w 3H 
they may also be extended to the 8?° tu 
groups. For example, a small manuta 
unit may encounter fewer problems 0 t o 
relations since the pare at? 


ped 
ay abandon the operatio pul 


esist 


mobility of the Magt 1 
Presents a departure t e 0% 


mobi fora ata FOND Which 7 pos 
later. 
Power, t 

The power of E is Contingent upo” sacle 
ted g 
mpe” 


variety of reasons to be : pa” 
hus again mobility may t 


two other qualifications: (g) the pe 
istics of the group which he is esp? 
Join and (5) the probability of a rep! 


TOWARD A THEORY OF OPEN AND CLOSED GROUPS 


If E is to be replaced, it behooves the group 
to react to Æ in terms of the position occupied 
nig adjustment problems during the transi- 
ae Period will be minimized if the position 
mains active. If, on the other hand, the 
eee to be eliminated from the organiza- 
ie Structure, E’s power will be minimized 
ng the final phases of transition. 
š a a of E’s host groups must also 
poiret T ered in relation to E’s penultimate 
is directi may be hypothesized that Z’s power 
group i y related to the power of the host 
tapid an relation to the donor group. The 
ite of the power of the member 
retiring Ing retirement is relevant. Since the 
Sites D usually does not rejoin an 
organizati social unit, power derived from 
A ie ‘onal affiliation is withdrawn. On the 
area bere if E is to join another group whose 
ig er uence extends to the donor group, 
Same ee Probably does not decline to the 
e hepi or rate. In this regard, it would 
and of p to deduce the power path of leaders 
tetirement group members as they approach 
Path dee 1 For example, assuming the power 
first ob, Clerates, at what point is the change 
What į Served by the members and E, and 
s e€ nature of the deceleration curve? 
that ES general sense, then, it is proposed 
relateq “og mobility and past mobility are 
Soup Th the power of an individual in the 
by son 2E nature of the power is qualified 


Som x 
a € of the conditions discussed here. 


e D 
Cbarture Alternative in Open Groups 


Mental S already been indicated, a funda- 
Characteristic of open groups is that 
of the ol members into and from the core 
Possible Tr lex is expected, happens, or is 
We ara | t ÎS the last qualification with which 
i concerned in this section. 
tiong „© Bross level of abstraction, organiza- 
Case g aY be differentiated with respect to the 
Case È Ina sense, this is merely a special 
lasite “mphasis of the general open group 
Sherga Ahon ; but a number of ramifications 
attit. hich are relatively specific to this 


m 
i leave ations which are relatively difficult 
a insti CC Teadily recalled: a prison, men- 
NY ato Utions in some cases, hospitals, mili- 
“PS, an island community or isolated 
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community in general, and perhaps some na- 
tions. What, now, are some of the social- 
psychological consequences of barriers to 
departure or ease of departure? 

Here again, we will invoke a decision- 
making framework. In general terms, the 
option to leave the group presents an ex- 
panded range of alternatives to the decision- 
maker; the individual can withdraw member- 
ship. While relatively little research has been 
concerned with the range of alternatives in 
a decision-making situation a number of 
hypotheses are suggested. 

As the number of alternatives are increased, 
in a decision-making situation, the degree of 
conflict is intensified. Thus, the introduction 
of the departure alternative increases the 
degree of conflict for the group members. 

March and Simon (1958) have speculated 
that “the greater the number of perceived 
alternatives to participation available in the 
external environment, the less important the 
consequences associated with variations in 
conformity to organizational demands [p. 
58].” For example, they hypothesize that 
the greater the number of unemployed 
workers the fewer the perceived alternatives 
to participation. 

It has also been hypothesized (Ziller, 
1955) that an expanded range of alternatives 
on scale of judgment is associated more fre- 
quently with a decision of superior quality. 
The rationale for this proposition rests on 
the assumption that the probability is greater 
that the superior decision will be included 
in a relatively heterogeneous and expanded 
range of alternatives. In terms of the decision 
of group membership, it may be hypothesized 
that an increase in the number of alternatives 
available to the individual will lead to a 
superior decision and perhaps to a more 
personally satisfactory decision. 

Four variations in the initiation of change 
must be considered: (a) only the individual 
member may initiate change in group mem- 
bership and group approval is not required, 
(b) only the group may initiate change pro- 
ceedings for the individual, (c) either a or b, 
(d) both the individual and the group must 
agree on change proceedings. It is immedi- 
ately recognized that these four conditions 
describe the focus of power for initiating 
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membership change. Condition d probably 
requires an arbiter. 

Given Condition a, the individual member 
may be expected to retain group membership 
as long as his needs are satisfied to a greater 
degree by that group than by any other group 
in which membership is available. The mem- 
ber is in a buyer’s market, as it were. More- 
over, it is difficult to impose sanctions on 
group members under these conditions, As- 
suming that membership in similar groups 
is available, the individual is free to with- 
draw membership or retain membership but 
withdraw from active group participation. 
Moreover, if additions are precluded and 
replacement is the only Possible means of 
membership change, the power of the mem- 
bers under these conditions is enhanced. 

The power of the individual under Condi- 
tion @ derives from several sources. If the 
individual has assumed a position and role 
of responsibility in the group, the group has 
become, by definition, dependent upon this 
member, to a degree. Thus, if a responsible 
member of the group considers removing his 
skills from the group, the state of equilibrium 
achieved by the group is threatened. Of 
Course, the loss to the group is in Proportion 
to the difficulty of finding a replacement. 
However, the member remains in 
the group, the Sreater the group's dependence 
on the member, if only in the way of group 


atic interdependencies 


departure of other members may be precipi 


fer- 
ring membership to another group and adjust- 
in the immedi- 
rences between 


membership to 
another group and adjustment in a relatively 


Finally, the power of the members of groups 
under Condition a derives from the threat to 
the group image. When a member leaves 
under Condition a, the departing member 
communicates, in effect, to the other members 
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and other potential members that the goa 
did not satisfy his needs and, by genera a 
tion, may not satisfy the needs of labke 
members to the same extent as other availa 
roups. i 
: Under the Condition 6 of members 
change, the group may be said to retain indi- 
member for that period of time that te atter 
vidual satisfies the needs of the group t th 
than any other potential member and 4 t 
lowest cost to the group. Here, le e! 
group is in a buyer’s market; departure uacy 
Condition & connotes individual ieden es 
in contrast to Condition @ which ection 
group inadequacy, Thus, powerful pune: 
are at the disposal of the group, and er : 
pressure toward conformity may be T tores 
This condition may be observed in CU ae 
where only the male can initiate divorc iads 
ceedings, in a military setting during Freu 
of conscription, or, in general, under z ovel 
stances where the group holds preceden¢ 
the individual, ature 
Under Condition a, the cost of ee the 
is a Primary concern of the group an 
individual may be assumed to weigh ee 
in his decision to leave. Similarly, a pri 
Condition b, the cost of departure }§ condi 
mary concern of the individual. Under je 008" 
tion c, a game theory model obtains. Ta are 
and rewards to the group and indivi “ai be 
under constant assessment; and it oe of 
Predicted that a considerable amon se 
energy is expended by the parties t° ain 
their relative Positions in order to ee no! 
a bargaining position. Neither the erin c0 
the individual can afford to relax the! othe 
tinuous Surveillance procedures; thé 
may be Probing for an advantage. mot 
Under Condition g the game theory s ie 
still applies, but the decision in t@™ py 


cos 


Costs and rewards 1S usually arrive wit 
means of arbitration with at least owi 
party. By this individua te” 


f means, t ed 
strategies are circumvented or at leas ure al? 
_ Each of these Conditions for depart g ont 
1s related to conditions for joining the EP 
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tion a since a greater degree of personal 
commitment is involved. 
aly, these same departure conditions 
i related to techniques of personnel selec- 
a a fronp which can discard personnel 
Ís lees ata impunity and minimum difficulty 
ee ides The be concerned with selection 
4 Pies ' he on-the-job situation provides 
citeumsta nt selection criterion under some 
Nees, 
group reion of a member to leave the 
an extensi avor of another group may have 
ree aloe impact on the group from yet 
è eten points of view. As is indicated by 
leave 7 & phrase, favor of, the decision to 
as unfla group of origin may be perceived 


g to ea to the donor group and flatter- 


19 58), 
i However, w 


e host group (Caplow & McGee, 


inher hen a colleague leaves the group 
bers ently enlarges the sphere of the mem- 
the original group to include some 
of the emigree’s new group. The 
derogation of the donor group 
“parture and the enlargement of the 
Pattures a ld may tend to activate other de- 
Severn) OM the initial group, particularly if 
Members leave simultaneously thereby 
B an impression of down mobility in 
group. 
the departure of a member, and 
the leader, may precipitate a 
n of changes in the organization 
to ingg OrSanization may be less reluctant 
lon aie changes. Ideas and complaints 
Uminisp an t in the minds of members and 
è te Eea ation may now find expression. Since 
Sar ation is required in any case with 
Sng group membership, a thorough 
May be facilitated. 


metin 
© origina] 


Since €Ntatio 


"Ate 
R y 
p Y Of Personnel Changes 


Toh 
er $ : n 
Teme an associated with changing group 


ave probably been recognized 
es. The adoption of the dynasty 
Primogeniture principle to the succes- 
ene Was undoubtedly a response to 
ia “ss of succession difficulties. : 

velli (circa 1532), in The Prince, 
the problem and described a 
n Whereby the Prince succeeded to 
of a newly conquered kingdom, 


Sion af 
“a awar 
Mach 


ecg, . 
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enforced drastic changes in government, and 
yet won the everlasting loyalty and gratitude 
of the subjugated people. To accomplish this, 
a myrmidon was frst assigned as ruler by 
the Prince. This subordinate effected with dis- 
patch and utter ruthlessness the changes 
deemed necessary by the Prince. When all had 
been accomplished to the Prince’s satisfac- 
tion, the Prince entered the city, convoked 
the people along with the myrmidon, and 
slew the scoundrel before all as an act of 
justice and atonement—a social gambit, as 
it were. 

The ubiquitous nature of the personnel 
change problem finds expression in the gnome, 
“Never change horses in the middle of a 
stream.” Finally, an old English sage has 
advised, “Never marry a widow unless her 
husband was hanged.” It is curious to note 
that the latter bit of wisdom is supported by 
Gouldner (1954) who refers to the tendency 
to compare the replacement unfavorably with 
the departed member as the “Rebecca Myth.” 
The tendency for widows to exaggerate the 
virility of their late husbands (Bernard, 
1956) may also be cited in support of this 
last hypothesis. 

Thus a more systematic analysis of some 
of these ancient bits of wisdom may prove 
productive. The dynasty and primogeniture 
methods of succession immediately suggest, 
for example, a decision-making model in 
which an automatic decision-making device is 
used. By imposing an automatic decision- 
making device, internecine affairs are avoided, 
and the state is not vulnerable during an 
unduly protracted period of interregnum. 
Thus, in contrast to a dictatorial form of 
government, democracies have institutional- 
ized an automatic succession procedure which 
presumably insures a more stable organization 
over time (Brezezinski & Huntington, 1964, 
pp. 89-109). Other automatic succession pro- 
cedures include: promotion in accord with 
longevity or age, the exclusion from considera- 
tion of members of the group because of such 
visible characteristics as sex or race, and 
random selection by lots. 

As was suggested earlier, change per se is 
hypothesized to be disliked by some indi- 
viduals under most circumstances. In addition 
to the explanations proffered earlier, specu- 
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latively, a change in personnel may be per- 
ceived unfavorably in general because it re- 
minds one that everyone, including self, is 
replaceable. The generalized antichange dis- 
position may be displaced toward the new- 
comer, the characteristics of the newcomer 
notwithstanding. However, if the replacement 
is delayed (a period of mourning, as it were, 
in which it is taboo to replace the departed 
member) the recognition of the need for a 
replacement may surmount the antipathy 
toward changes in personnel and counteract 
the so-called “Rebecca Myth” phenomenon. 
In general, it is hypothesized that a new- 
comer will be assimilated more teadily by 
recognized need 
may be assumed, 


N rs of successful 
groups, in contrast to those in groups that 


of sharing group rewards, and the group’s 
o its environment. 


an agent or cat: 
where change i 
1960). 

It will be assumed that there js a tendency 
to compare the newcomer invidiously with the 
the former member. It js also quite con- 
ceivable that if the newcomer is more favor- 
ably treated than the departed member, the 
assimilation Process may be further retarded 
and the change situation further exacerbated, 
In open groups the preferential treatment of 
the latecomer may be interpreted as evidence 
that a premium is placed on mobility rather 
than loyalty, 

Still, in some respects, the replacement’s 
role is preferable to that of the addition: The 
replacement occupies a Position vacated by 
his predecessor, a position to which com- 

munication channels haye been previously 
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established. For the addition, the posit! 
must be defined and established. ement 
More simply, perhaps, the oe 7 
represents a less abrupt change both ie 
newcomer and the group. The position fort! 
ment permits the new member to pê idiv 
various group functions and accumulate i 
syncratic credits (Hollander, 1958) t fol 
allow for greater individual adjusts 
lowing the initial orientation phases e; 
assimilation process. Stated otherwis mer 0 
Position assignment enables the n eee nl 
focus on the task demands without pfus" 
complicating the situation with a 00 ghid 
array of interpersonal considerations i 
can only be resolved over time. facili 
Membership transfer may also ne may 
tated by avoiding an abrupt exodus. complet’ 
involve the provision of time to ibilt 
ongoing activities or transfer resp a 
for these activities to other members nden? 
donor group. A period of “indep? 
training” may also be necessary. 
The abruptness of change may als? 
liated by increasing the number 0 an 
between the prospective new member si H 
host group and simultaneously decre? ou?! 
number of contacts with the donor s0% 
This technique has been advocated, j whe 
case workers in an adoption situati? fost? 
the child is initially under the care 0 
parent (Gerurd & Dukette, 1954). pE 
Similarly, in order to reduce the aD! itv 
? I 
of change (“cultural shock”) from, 
tional life to civilian life, “half-way | av 
have been developed for persons W ; 


e rtuti? y 
been discharged from penal institut! e h 
this interim 


munity has 

with similar Problems, acid? 
© use of interim groups to “ tt 

transfer between ali pP a 

ties for exchange 

problems 


through catharsis. Ho 


es 
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number of members in the host group ap- 
proaches a substantial percent of the host 
group, a disintegrating effect may displace the 
ene assimilation effect (Anonymous, 

46). The determination of the optimum 


i 
o of new members to regular members 
amenable to empirical test. 
« 
cl 
Se shock” of rural migrants to urban- 
., vas studied by Dynes (1956). Using 
itude mea 
oes : : 
Cushion the shock if the migrant is from 
economic level. 
So bas re 
n P 
È Mills 1959) that the difficulty of 
lateq os &roup membership is directly re- 
tionshi relationship is explained in rela- 
isso to Festinger’s (1957) theory of 
t is assumed that there are un- 
as well as attractive aspects of 
. “Ergo, k 
tion Sone an unpleasant initiation his cogni- 
‘Asa, ee x : 
Bto oot experience in order to achieve 
Nes: 
Mattractive at some aspects of the group are 
q Pasi ‘mizing the initiation or over- 
e 


The use of sectarian groups to cushion the 
attit 3 
sures, it was found that the sect 
Ower socio, 
cently been demonstrated (Aron- 
achiey; 
; e pe: i a 
Ship, This Perceived value of the member 
. “Nance. 3 neS 5 
Ance, consonance in a decision-making 
Memb 
ership ; 

Under Pin any group. If a person has 
Unple at he has subjected himself to an 
n Lain. S a b 
Waren ership is dissonant with the 
ithe Ve. Consonance can be effected by 

lzi 5 
Ng the attraction of the group. As 


Sever; 
beco oo of i the initiation increases, it 
Limize a asingly difficult to ignore or 
ae ing ony © negative aspects of induction, 
alue pe y the alternative to exaggerate the 
isa © group. 

trast p Dothesized that in open groups, 
ip ;. © Closed groups, a change in mem- 


is > 
A Rot likely to assume undue impor- 
as a . need no 
Thy, “Aleut 


hot — needed change in leadership 
t or p © as embarrassing to either the 
© Broup under conditions where 
eg, CUP. membership are regular 
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Assimilation of the Newcomer in the Group 


Various aspects of the process of the 
assimilation of the new member were dis- 
cussed under the topics of “Power” and 
“Strategy Effect in Personnel Changes.” Here 
the assimilation process will be considered in 
detail. 

Assimilation is defined here as the process 
by which a newcomer establishes a positive 
affective identification with the host group; 
and the host group, in turn, comes to perceive 
the new member as an integral part of the 
social unit. For more than forty years the 
theoretical approaches to the newcomer phe- 
nomenon have been based largely on the 
writings of Simmel (Wolff, 1950). In general, 
Simmel described the advantages of a stranger 
in a comparatively stable society.- The 
stranger’s or “potential wanderer’s” advan- 
tages were said to derive from the following 
characteristics of his position in the group: 
mobility, objectivity, confidance, freedom from 
convention, and abstract relations. With re- 
gard to mobility it was said that the stranger 
may interact with the members of the group 
but is not intrinsically related to any member 
through kinship, locality, or occupation. 

Moreover, it is more probable that a 
stranger can adapt an attitude of objectivity 
with regard to group practices since he is 
not committed to the pressures toward group 
conformity. In support Simmel cited the 
practice of ancient Italian cities to appoint 
judges from outside their realm because 
endogenous citizens were not free from family 
entanglements and party interests. Related to 
the objective role is the observation that 
strangers (especially the stranger passant) 
often are trusted with confidences which are 
withheld from more closely related stable 
members of the group. 

In addition, the stranger is not constrained 
by convention or precedent. Finally, Simmel 
described the tendency to perceive strangers 
in more abstract terms. With the stranger, 
one has only certain general qualities in com- 
mon, whereas with more closely related group 
members our relationships are more detailed. 

In the United States, the social scientists’ 
concern with the phenomenon of the new- 
comer in the group may be traced at least to 
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the time of the formal recognition of Ameri- 
can sociology. The original justification for 
awarding academic status to sociology was 
the study of immigration, a problem with 
which no other social scientist dealt at that 
time (Shils, 1948). More recently, the mas- 
sive population shifts particularly to Israel, 
Australia, and the United States, and within 
the United States have stimulated renewed 
interest in the problem of the newcomer and 
the host group under conditions of changing 
group composition (Eisenstadt, 1951, 1952; 
Richardson, 1957a; Taft, 1957; Weinberg, 
1953). Within the United States alone, it 
has been estimated that one out of every five 
families moves each year (Rossi, 1955). 
The early immigration studies 
force on historical documents, ca 
demographic data, and informal ol 
The more recent studies tend to 
vey research techniques. Other investigations 
have employed criteria of assimilation such 
as vocabulary changes (Richardson, 1957) 
and objective measures of status (Eisenstadt, 
1951). Moreover, some of the programs have 
developed well-formulated theoretical frame- 
works from which, Presumably, experimental 
Programs will emanate (Taft, 1957). 


esults of these 
has been found 


relied per- 
se studies, 
bservations. 
rely on sur- 


ment, and youth [Ei 
(c) “ [There is] a 
between the gener: 
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characteristics. This is one of the oni 
tations of immigration research with E I 
to the more general problem of the a f 
tion of the newcomer in the group. Ns tudië 
ily this limitation of the immigration vei 
also restricts the scope of the ore 
structure based upon these studies. poitt 
The research discussed up to “piens é 
has concerned the microcosmic pra tly, 
immigration. However, more pee i 
search has become at once more m a 
and more generic; the newcomer phe? gall 
has been explored with reference tO milie" - 
social units and wider variety ° 
Studies were found concerning the (z jont 
tion of the foreign college student ) 
1952), the new member of the eae? ne 
Supreme Court (Snyder, 1958)s r et al 
recruit in an Army platoon (Stoute 
1949), and the new business leader ‘i 
sen, 1953). Several comparative $ g Allee 
animal behavior (King, 1954; Potter ail 
1950) and of laboratory experiments, ; 
Behringer, 1960; Ziller, Bebri”? 
Goodchilds, 1962) were also found. plems r 
he microcosmic approach to Pr? t el 
changing group composition enables pni "y 
Perimenter to employ small group tens o. 
which have been developed rather e ition | 
during the Jast fifteen years. Recogy mel 
the advantages of small group reseat cat 


s J k 
it! 

In a more recent experiment W' eh? ye | 

groups (Ziller & Behringer, 1961), De 

Similation Proc e 


g! that ng 
e 
es. The results suggested w g 
newcomer’s Popularity during aol wilt 
Period described a U-shaped curv® sih y 


ne in the early weeks- etd é 


Phenomenon Was observed with bo s! 
Fulbright Srantees visiting the Un! oo 
(Lysgaard, 1955), 


ch ace 
are” of” | 
he results of a program of res?" efet yf 


cerning the acceptance of a neW paf 
Stroup has been reported recently 
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(1958). In general, the results were derived 
Tom a response-response experimental design 
n which individuals in a new reference group 
be a or the Navy) completed question- 
ie ee individual characteristics on 
referene, and and acceptance of the new 
hole on the other. The following 
S were supported. 
on ai of a new group as a reference 
a) a hier ound to be associated with: 
at oro ency to perceive the members of 
tieram as exhibiting similarity to each 
| Sbjectives o the observer in tastes, goals, and 
support, i as offering each other mutual 
or Unique ) a tendency to perceive the group 
awareness cont (c) a conviction of clear 
tions- (d of the group’s processes and func- 
in the a” the host group’s relative position 
e Aurai Wvidual’s hierarchy of groups; (e) 
Ositive een of other reference groups; (f) 
era] fests Pectations of a group; (g) the gen- 
th ena to associate with groups; (A4) 
With grou a length of anticipated contact 
Eenera] Sa hp Positive rather than negative 
Newco lal response patterns. o 
Study mb (1961) conducted a longitudinal 
ticulargy the acquaintance process and, par- 
llega? Mutual attraction among 17 male 
teed te Wcomers and strangers who had 
Underi live together for four months. 
theory 708 the study was a system balance 
rameworye T on Heider’s (1958) interpersonal 
Ha hus it was proposed that a 
an interpersonal system develops 
other a Persons who are attracted to each 
atttaction et in agreement with regard to the 
sy Mon pr a third person. One of the 
a ‘ee of achieving balance in the 
a t change attraction in the direc- 
wee ent Se with whom one was in closer 
rela fons egos regard to attitudes and 
( ay ted Š Pae Some of the conclusions 
attip Breemer r series of studies include: 
or tug es dia concerning relatively stabile 
Dren a Slane the acquaintance process be- 
} ences” nae determinant of attraction 
> (b) Nonauthoritarians tend to 
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tion units of larger size (high-attraction pairs 
tend to build up into high-attraction triads, 
tetrads, and even larger units). (d) Inter- 
personal relationships of attraction stabilize 
at relatively early stages of development as 
individuals cease to acquire new information 
about each other. 

Again, it is noted that the results of the 
preceding series of studies concern exclusively 
the characteristics of the newcomer to the 
group, since the characteristics of the host 
group are not varied under this type of de- 
sign. Indeed, in Newcomb’s (1961) study 
the newcomer’s and the host group's charac- 
teristics were confounded. A more systematic 
and inclusive analysis of the newcomer phe- 
nomenon is demanded. 

As a preliminary step in the analysis of the 
newcomer phenomenon, a framework of the 
salient components in the assimilation proc- 
ess may facilitate more comprehensive experi- 
mental inquiries. The components of the 
phenomenon appear to be: (a) the character- 
istics of the host group, (b) the characteristics 
of the donor group, (c) the characteristics of 
the newcomer, (d) the relationship between 
the host and donor group, (e) introduction 
and orientation procedures involving the new- 
comer, and (f) the bases for changes in group 
composition. 


Host Group 


In analyzing the various components of the 
assimilation process, we will have recourse to 
a decision-making model in which the rewards 
and cost relative to the newcomer and the 
host group are evaluated in juxtaposition. 
With reference to the personality of the 
host group, it is broadly hypothesized that 
sharply defined group boundaries tend to 
increase the cost of assimilation of new 
members both to the member and to the 
host group. 

The hypothesis is almost trivial for its 
simplicity. The hypothesis simply holds that 
a broadly defined category can embrace a 
great many more objects than a narrowly 
defined category, and the cost to the group 
of establishing the criteriability of the object 
is diminished, thereby reducing the difficulty 
of assimilation both for the object and the 
perceiver (local group). A wide category host 


178 


group, by necessity, places the onus of defini- 
tion of role on the new member rather than 
on the system. Since selection science is rather 
inexact, this provides a degree of tolerance 
for the system which should facilitate inte- 
gration. Moreover, by broadly defining the 
group, a set for tolerance rather than a set 
for criticism is created. The newcomer is not 
surveyed with a jaundiced eye but rather 
with a positive attitude toward idiosyncracies, 

A wide-category group as defined here may 
be created in several ways. Group size js 
associated with the Permeability of group 
boundaries and the ope 
(simply on a probability basis; e.g., the 


itself. Recently, Pettigr 
the measurement of a“ 
able; subjects tended to 
medium, or narrow in 

ranges of the phenomen 
Although the interperso 
of the instrument have 
date, it may still be hypothesized th 
composed of members with wide cate 
indices as opposed to those with 1 
will tend to assimilate a new me 
readily. Similarly, a newcomer 

category width tendencies may a 


at groups 
gory width 
ow indices 
mber more 
with wide 
lso be ey- 
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pected to accept membership in a new 8" ni 

more rapidly and thoroughly. opost 
At a more general level, it may be Pr it 

that highly differentiated or complex jue! 

and newcomers facilitate assimilation A fact! 

the increased probability of matching 

of the individual and the group. 


Donor Group 


«tics if 

The relationship of the characteris i 
the donor group to those of the hos he pro 
a second component in the assimilato? ©, 
ess. Again, quasi-statistical mea t gre 
invoked to support the hypothesis t g i 
similarity between the host and ee. dit 
tends to result in reduced assimilati p 
culties. Again, if the host and dont ie p 
are similar with regard to culture, 8 2. g 


<. qifficulties 
sonnel, and structure, transition diffic 


a 
pati p 


o 
of the new college student. b; a 
Wentholdt (1956) in studying = Zenl 
ment of Dutch immigrants to Nev eadi" 
Suggests that successful personality „, soci 
ment of an immigrant to a “hos ani "i 
varies with the degree of self-reorg egr? gl 
required by that society and the mit” 


Preparedness on the part of the race É 
to have such n ; 


reorganization take P 

Still, the dune of the donor arora 
to retard the growth of an integra t 
ship between the newcomer od 0 gi 
group. In Christie’s (1954) stu A it 
transition from civilian to Army watt fh 
that greater distance O, frie! 
d contact with family 2” wal 


Concluded 
and limite 
are more 


life during the basic training ph@ 


imi oh 
ley’s (1958) studies covering the 3 
eye S 
Peditious use of } s Og 
assimilation proce, ro) 
United States a fs y 
cademy is discussed by Dornbus pol w 
used Miller an@ yz o 
(1941) social learning theory to ania! | 
een reward an one ‘ j 
membership 12 i ne 
subsequent rect" 


other social movements. It was hypothesized 
that “the greater the punishment which led 
to defection, the less similar to the original 
> eae is the new movement most likely 
in e chosen. Perhaps successive partners 

Courtship would be somewhat more 


a > h 
menable to experimentation than social 
movements, 


The Newcomer 


rin major component of the assimila- 
earlier fee is the newcomer. As indicated 
| compere on of the ease of study, this 
analysis : phe been subjected to detailed 
952) amet the studies by Eisenstadt 
"eDeating th Tartley (1958) are cited. Without 
Suggest he findings in detail, these studies 
Sonality € existence of a cosmopolitan per- 
ables 4’, 2 Complex of behaviors which en- 
© Individual to adjust to a variety of 


A Social . . ., . 
vitonen tS of changing social 


er 
Politan n° (1949) analysis of the “cosmo- 


here, The Posed to the “local” is pertinent 
of ey Cosmopolitans had lived in a variety 
The ae and were newcomers to the town. 
Seldo, cals were usually natives and had 
The cost away except for brief periods. 
Man Smopolitans were not members of as 
Dor groups as the locals. They stressed the 
"ther i of the qualities of their friends 
Moreover the number of people they knew. 
Mganizati where the locals tended to join 
theip ta lons for the purpose of extending 
tende oe „Of contacts, the cosmopolitans 
Skills „2 Join organizations in which special 
Tt is <nowledge were important. 

I m miae conceivable, however, that the 
ip, ole stem havior of these contrasting 
y Charact S from some underlying personal- 
R the sap) again related to complexity 
ta tioned “Concept. In terms of the afore- 
i be a sa lles the cosmopolitan appears 
to Sentimente p ile is not grossly nostalgic 
w t forma, 1S liberal; in perception and 
cate ca a n characteristically employs 
Vie Stes an Width and a large number of 
ts the wy er than dichotomous divisions; 
he is orld positively; tends to be a risk 

E cog ot Cynical; and is not prejudiced. 
Politan in contrast to the local, is 
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more highly transferable and his abilities are 
more highly transferable. 

Transferability implies, as a very base, a 
positive attitude toward change, or at least, 
a lower resistance to change. In addition, 
since a variety of persons may be encoun- 
tered who may be arranged along a normal 
curve on a number of dimensions, the proba- 
bility of adjustment in new groups is probably 
increased if the mobile individual is located 
near the mean of the population on a number 
of these same dimensions. Thus, if the normal 
population tends to be only moderately preju- 
diced toward Negroes, for example, the highly 
prejudiced or highly pro-Negro newcomer 
will tend to have more difficulty adjusting to 
and gaining acceptance in a number of groups. 
This may suggest that individuals who suc- 
cessfully adjust to a variety of new situations 
tend to be conformists in relation to the be- 
havior of the greatest number of persons in 
the population in which they are likely to 
move; just as the locally oriented individual is 
a conformist in relation to the behavior of the 
greatest number of persons in his immediate 
environment. 

The discussion concerning the assimilation 
of the newcomer in the group has indicated 
that the change involved is more or less dis- 
turbing to either the group or the individual 
depending upon the characteristics of the 
group or the individual. It is now suggested 
that a change in group membership requires a 
reevaluation of the host group or the self. 
Under conditions of group stability less ac- 
ceptable characteristics of the individual or 
of the group may be dissociated; the indi- 
vidual and the group through collusion may 
avoid confrontation of issues which are known 
to be disturbing. The introduction of a 
change, however, and a change of personnel 
in particular, forces a confrontation with the 
self. The newcomer mirrors the group to it- 
self; and the group, in turn, mirrors the 
individual to himself. The confrontation is 
inherent in the initial assimilation process. 
Exposure must precede acceptance or adjust- 
ment in whatever form it takes. Thus, the 
individual or group experiencing difficulty in 
self-acceptance will have difficulty adjusting 
to changes in group membership; unresolved 
but compartmentalized or dissociated prob- 
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lems are reflected in interaction with the unin- 
formed and unprepared other. 

Continuous confrontation of the self either 
by the group or by the individual is presumed 
to generate countercathexis and an inward 
orientation which may detract from the gen- 
eral effectiveness of the organism to cope with 
day to day problems. Thus, a press is posited 
whereby both the stranger and the host group 
act to introject the other in order to avoid 
protracted mirroring or self-confrontation, 


Leadership Changes 


A special case of changes in group member- 
ship concerns changes in leadership or the 
group power structure. While there has been 
some speculation concerning the problem, sur- 
prisingly little research has been reported. 
Organizations are reluctant to permi 


through internal Promotions, 
Several field studies 
concerning leadership successi 
reported a 4%-5% successi 
dents of a random sample of gs 
brewing, and automotive j 
of succession among sc 
Carlson (1961) noted 
internal when employers wi 
system; but that outsid 
signed for innovation, 
Although the data an 


€ d conclusion are un- 
comfortably disparate, 


Grusky (1961) sug- 
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gests that bureaucracies require periodi, a 
cession at the top if they are to adap i 
quately to their environment, since al 
only means of infusing a degree of my 
which bureaucracies especially see. 
also suggests, however, that bureaucrat! ji 
only require more, but also can a i 
succession at the top levels because t strut 
tion is well defined in terms of gone id 
ture, power, and communication ciann rel? 
Laboratory experiments have eet 96 
tively few. Pryer, Flint, and pas Jeade’ 
found that successful groups change 
less often. ‘on und? 
Consideration of leadership noone ri 
various conditions of group ia the 
conditions of success and failure sar, go 
organizations in which responsibility o p” 
action is shared by the members ten of Y 
petuate the attitudes and outcomes rshi’ 
group following a change in formal lex js co 
whereas organizations in which poWe joi 
tralized 


por i 
leadership Succession. Thus, under ss H 
of 


with decentralized power may be exP ot 
co! 0 
zations with centralized power unde 
tions of leadership change are ar pria! 
Aa the past and improve sed w 
bility of Success. Thus, it is proPO? to io 
democratic Organizations are design’ gal 
Petuate success, whereas autocratic 


tions are designed to adjust to failure 
n 
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PSYCHOLOGICAL EFFECT OF EFFORT’ 


MICHAEL LEWIS 
Fels Research Institute, Yellow Springs, Ohio 


The present review explores the psychological effect of effort. While the 


present problem has received little 
divided into two major areas: (a) 
avoided in a choice situation and resu 
(b) Can effort once expended affect 


attention, the studies of effort can be 
Is effort an aversive stimulus, to be 
ting in decrements in response strength? 
the stimulus-event (reward) associated 


with that effort? The available literature does not provide any conclusive 
statements concerning the aversive effect of effort. However, effort does seem 
to enhance the value of a stimulus associated with the expenditure of effort. 


aa Problem of effort and its psychological 
ue S has received little attention. This was 
Baye ely to the belief, stemming from 
sive hia theory that work was an aver- 
and Stimulus to be avoided given the choice, 
Stren i it resulted in decrements in response 
a When applied. Recent investigation 
indicat oth assumptions to be in question and 
al cs that effort may even enhance the 
aa of stimuli associated with it. 
fot a purpose of this review, the litera- 
two m „ĉffort or work will be divided into 
an ag issues: (a) What are the effects 
the ne Organism’s expenditure of effort upon 
fortit effortful response? (b) Once an ef- 
Xo lon response has been made and effort is 
erortea Present, what are the effects of that 
u! response? 

Sumed under (a) is the question: Does 
Next gange the probability of making the 
be fortful response? This response could 
. “dice response or the continuation of 


Changes response. To test whether effort 
ano o the probability of choosing one or 
"eSDonse response, investigators made one 
Other, (choice) more difficult than the 
to a°, OT example, long versus short paths 


barrier p jumping or not jumping over a 
Dughi, ù Order to reach a goal, and, finally, 
a ight or heavy doors in order to 
ro Ea. To test whether effort changed 
onse bility of the continuation of a re- 
> MNvestigators tested how long an or- 


iT 
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ganism would continue to respond (with no 
reinforcement) when the amount of effort it 
needed to expend was varied. These extinc- 
tion types of experiments usually used Skinner 
boxes and differentially weighted levers. 

In neither the choice nor the performance 
experiments were the experimenters interested 
in the effect of effort once effort kad been 
expended and was no longer expended. Essen- 
tially, these experiments were interested in 
the question of whether the expenditure of 
effort was aversive. However, there were 
experimenters who were interested in the ef- 
fect of effort after effort had been expended. 
These investigators were involved with the 
question of the effects of effort on the stimu- 
lus event (primary or secondary reward) 
associated with the expenditure of effort. 

The following discussion will therefore be 
divided into two main areas: the effect of 
effort on a response, and the effect of effort, 
once expended, on the value of a stimulus 
event associated with the effortful response. 
The effect of effort on a response was 
measured in two ways: (a) preference (by 
choice) of the more difficult response over 
the easier one, and (b) the strength of the 
instrumental response. In studying the effect 
of effort on value of a stimulus event 
(primary or secondary reward), two measures 
of value were used: (a) choice; that is, 
choice of the stimulus event associated with 
high effort over low effort, and (b) the 
strength of the instrumental response in ob- 
taining the stimulus event associated with 
high effort as opposed to low effort. 
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EFFECT or EFFORT ON A RESPONSE 
Choice of Effortjul Responses 


The concept of effort was formally intro- 
duced by Hull as the Psychological variable 
of work in his Principles of Behavior (1943). 
The notion of work or effort is to be found, 
however, in earlier experiments such as those 
by DeCamp (1920), Gengerelli (1931), Kuo 
(1922), and Tsai (1932). One of Hull’s hy- 
Potheses concerning work was called the “least 
effort hypothesis” which stated: 


y learn to choose 
| r Sequence leading to the 
einforcing state of affairs (Hull, 


1943, p. 294]. 
In a later work (Hull, 


1952), the hypothesis 
was stated as follows: 


Other things being equal, organisms 


E hs in detouring about 


fewest turns, 
) used a symmetri- 


a heavier one 
o found that 
er low rather 
ain equivalent 
ulloch (1934) 
the lighter of 
S results were 
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A 
inconclusive in that the animals develope 
Position effect which made it impo o 
ascertain the effects of the pec ae same 
In a recent study by Chung (1965) ç welt 
Problem was again studied. Pigeons ing 
trained to depress two levers by M è 
each requiring 25 grams of force. d to de 
training procedure the force rao so thal 
Press one of the levers was change he othe 
one key was easier to depress than on oF 
Chung found that the rate of TERR SO 
one lever was independent of the $ 
force on the other lever, In age 
effortfulness was not an aversive $ 
and therefore the animals showed nO 5 
ence for the least effortful cage ate the 
unlike the other experiments on oa Jes 
animals showed no preference for thi? 


. 1 0: 
effortful response. With the excee oi onfi” 
result, the data on choice appear t 
Hull’s viewpoint, 

Strength of Instrumental Responses ib 


0 

Hull (1943) also viewed work in termi hal 
subtractive effect on performance on i rel 
it in several of his laws, particular!) hi tion 
tion to reactive and conditioned punct" 
He stated that “the net amount of ro yy 
ing inhibitory potential resulting ijy” 
Sequence of reaction evocations is & k 3 
accelerated function of the amount resp?" 
involved in the performance of the peliet" 
in question [p, 279].” Hull further a 4 
that this inhibitory potential act ti pe 
“Drimary negative motivation in thao? p 
the innate Capacity to produce a cess 219 


ae p 4 
the activity which Caused the state hive pe 
One of the differences between rea 
conditioned 


ure ijs WAS tes 
t inhibitory potentials st? 
duration of 


+ ationa 
the negative motivatio 5, 
Extinction q t : j Sat 


ih 
o 
Por! fe 
involving greater ish "je 
would presumably extingul?. g 


r rit? 
quickly than those responses requ! 
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effort, Crutchfield (1939) trained rats to pull 
differentially weighted food cups in order to 
obtain food and found results that did not 
agree with Hull’s expectation. During extinc- 
tion he found generally that those animals 
pulling the heavier weights tended to extin- 
Suish more slowly than those animals pulling 
lighter weights, 
Amcor and Jones (1943) attempted to 
ae ligate the relationship between effort and 
; ance to extinction by training a group 
can to press down on a lever in order to 
‘eke a food pellet. The training procedure 
th eaditioned all the animals to press down 
‘i grams of pressure and then gradually 
creased the pressure required to 80 grams. 
son animals were then divided into three 
i S based on the amount of effort needed 
€press the bar during extinction: 5, 42.5, 
&rams. The results indicated a perfect 
ve relationship between effort during 
ree tion and resistance to extinction and 
@ppeared to support Hull’s hypothesis. 
since Nie was difficult to interpret 
trials e animals received 180 reinforced 
compare. the light bar during training, as 
avy aa to only 80 reinforced trials on the 
explain m The results could therefore be 
quene e on the basis of differential _fre- 
rather. of reinforcement during acquisition 
tespo than on the basis of effortfulness of 
nse during extinction. 
of ne rabies (1951) controlled for frequency 
five S Orcement during acquisition. He used 
apres each trained with a different 
Stams Sing weight: 10, 20, 30, 40, and 50 
animals ach acquisition group contained five 
Was exti and was divided so that one animal 
bar-pr Mguished on each of the five different 
agr essing weights. The results generally 
Bteater i those of Mowrer and Jones—the 
t ar © amount of effort needed to press 
during extinction, the more quickly 
Tesponse extinguish, even among 
Who experienced a single common 
effort during acquisition. The results 
‘cated, however, that animals both 
bar to, and extinguished on the more effortful 
Onger to extinguish than those 
bar, and extinguished on the least effortful 
Order to explain this last finding, 
'S argued that the feedback stimula- 


Negati 
Extince 


of 
ing 


Zwe 


Lao 
tion from the bar-pressing response was 
analogous to Hulls V construct (stimulus 


intensity dynamism), and that it was greater 
under conditions of high rather than low 
effort. He further stated that response 
strength should be greater in the high-effort 
group than in the low-effort group because 
of V’s multiplicative relationship to other 
variables: sEr = Í (sHr X D XV). 

Maatsch, Adelman, and Denny (1954), 
using a larger number of subjects under the 
same experimental conditions as those of 
Mowrer and Jones, tested for the same effect. 
Using the time required to obtain the last 
10 rewards of the training period as a meas- 
ure of habit strength, the authors found no 
difference among the three groups. The 
results of their extinction data also indicated 
no significant differences among the groups. 

Capehart, Viney, and Hulicka (1958) also 
controlled for frequency of reinforcement 
during acquisition. These authors used 5-, 
40-, and 70-gram bar weights and found that 
the number of responses emitted during ex- 
tinction varied inversely with the effort- 
fulness of the response. Quartermain (1965) 
repeated the Capehart et al. experiment but 
was unable to find any difference in the 
number of responses during extinction as a 
function of the effort required in depressing 
the bar. 

Aiken (1957) had rats push open differ- 
entially weighted doors in order to get food 
and found no significant difference between 
groups during extinction. Solomon (1948) 
found that animals extinguished quicker when 
forced to jump 16 rather than 8 inches from 
one stand to another in order to receive food. 
It could be argued, however, that competing 
responses induced by the longer jump rather 
than differential effort were responsible for 
the differences in extinction between the 
groups. A jump of 16 inches might be more 
frightening than a jump of 8 inches, thus 
creating differential anxiety. In that the 16- 
inch jump associated with greater anxiety, 
one would expect more rapid extinction, 

Finally, the studies of Lawrence and Fest- 
inger (1962) are relevant to the discussion of 
effort and rate of extinction. The authors 
found that when the slope of a straight alley 
maze was varied, animals who were required 
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to run up a steep 45-degree slope (to exert 
greater effort) extinguished more slowly than 
animals who were required to run along a 
level runway. In another experiment, Law- 
rence and Festinger demonstrated that in- 
creased effort during partial reward intensi- 
fied the effect of unrewarded experiences on 


Thus, because of inconsistent results, it can- 


decreases the 
strength of an instumental response, 


The previous experiments had little bearing 
on the value of the goal or of the reinforcer 
itself, They related primarily to the choice 
or strength of instrumental respo; 


or secondary reward 
choice or by the stre: 


and used thi 
ures of the valence of the object, His ex- 
periment may be interpreted i 

that anticipated effort enhances 
or incentive value of a goal. 
indicate, however, the effect of a = 
ing the effortful response. Child and Adelshiem 
(1944) repeated Wright’s experiment and re- 
ported contradictory findings, Their subjects 
preferred the more easily attainable goals, In 
a subsequent study, Child (1946) attempted 
to demonstrate that the Preference for a goal 
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more difficult to attain was the esl 
several factors, the most important a be: 
Was previous learning or socialization dif 
respect to differential reinforcement E. to 
ferential effort, such that the child par- 
value the stimulus properties of effort si 
He introduced variables such as ape ol 
and showed that the preference 4 boys 
distant goal increased with age, and ye 
were more likely to prefer it than erent 
attributed these phenomena to aip ne i 
degrees of learning of a value ae er! 
which effortful goals were sometimes P enets 
because of previous reinforcing consa u 
rather than to a direct effect of effor 
valence, 
Festinger’s (1957, 1961) theory he pro” 
nitive dissonance is also relevant to t e el 
lem of effort and value. Festinger en 1 
cognitive dissonance as occurring rt be 
Cognitive elements are not in agreemen tivated 
dissonance occurs, the organism i mo e% 
to reduce the dissonance by changing gp” 
both elements until they agree. In “thos 
by Festinger and Aronson (1960), the je va 
Specifically discussed the theory’s ia w 
to effort and value, They contended & ev ard 
an organism obtains an “insufficient 3 
relative to a Particular expenditure of & the 


there will be a tendency to discont ont! 
effortful act! 


p” 


of cog 


ivity or to attribute 2 


value to the activity or to its ma r 
quences, This development of an “esT cos 
erence” for the activity or its 8° 


fro at 
on; tp 


i alences is conceived as resulting | te 
psit? 


internal Proce: 
is, the Teceip 
with the eff 
leads to a State of cognitive disson4 
the Organism js 

study by 4, 
directly asse: 
warded trial 


SS of dissonance resolut t 
t of a reward incomm™ ion, 
ort expended in its ae o 
jp 
Motivated to reduce, by, 
ronson (1961), preferen wr 
Ssed following a series a“ 
S on tasks which requite 


at er group. The high-effort group >, Jory 
ver that shown by á ond” 
rrelevant aspects (5 ý 


reinforcement) of the goal situation- p” ) 


j 

: pÉ 

3 (1953) useq children in 4? apt 

meng attempting to show that “y pi” y 
Practice” 


c 
affects the value of a 5° 
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reinforcer, Practice in “wanting” was varied 
by using a task in which differences in ex- 
pended effort could be introduced. This was 
done by varying the number of turns of a 
crank necessary to attain a reinforcer. In the 
habituation phase of the experiment, Olds 
on the children to turn a crank three 
; es In order to receive a chip. They did this 
pr 20 days, receiving 8 chips a day. At 
dima | of this period the subjects were 
ed Into four groups. Group A now had to 
> the crank 12 times to get a chip; Group 
ck times; Group C, 3 times; and Group D, 
children Using the pressure exerted by the 
i en in turning the crank as the measure 
ol AF secondary reward value of the chips, 
chips ound that Groups A and B valued the 
ine than Groups C and D. While the 
effort mA be interpreted as indicating that 
“a Irectly affects the value of a reinforcer, 
that aca also be interpreted as indicating 
children chip value is constant, but that the 
Rteater in Groups A and B turned with 
Pressure simply because they experi- 


ence : 
fr d an Mcrement of drive attributable to 
“tration 


bee ee recent and related experiments have 
Whet aes 4 concerned with the problem of 
in valu a stimulus event undergoes changes 
during 2S 2 function of the effort expended 
the response it reinforced. In one 

Btade te (Lewis, 1964a), first- and sixth- 
Or low S Were subjected to either a high- 
Poker ott Procedure in order to obtain 20 
Lumber Ips. Effort was varied by altering the 
to o _of turns of a crank handle necessary 
Of val re a single chip. Two kinds of indices 
assessed Were used: (a) preference value was 
tern, iv „determining the number of al- 
trainin © chips (differing in color from the 
the chip) the child would exchange for 
Valug iting chips; and (b) reinforcement 
Chip on he effect of presentation of a training 
€ probability of the occurrence of a 

„Dew response—was assessed in a 

seq choice problem where the chips were 


as : 
Whi the reinforcers for a correct response. 
© the 


fiec Preference value failed to show the 
the effort, the reinforcement value of 
With pro Was affected so that those associated 
Že effort were more effective in in- 


€ Probability of the occurrence of 
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the desired response than were those as- 
sociated with low effort. 

In another study using animals as subjects, 
Lewis (1964b) again tested whether the re- 
ward value of an object is functionally related 
to the amount of effort required to obtain 
that object. Three experiments were con- 
ducted where rats were required to pull 
weights in order to obtain a minimal amount 
of a relatively unique food reward. Four 
weight groups were used: 5, 30, 55, and 80 
grams. The weight that the subject pulled 
was constant from trial to trial. After the 
training periods the weights were removed 
and not used during the testing procedure. 
During the test phase of the experiments, 
value was assessed in three ways: the running 
speed to the food in the straight alley maze, 
the speed with which the animal consumed 
the food, and the amount of the food con- 
sumed in a free-feeding situation when the 
animal was satiated on another food. The 
results of all three experiments supported the 
hypothesis that animals expending the greatest 
effort in a training series develop the strongest 
preference for the stimulus event (food). 

In a final experiment, Lewis (1965) tested 
the hypothesis that a secondary reinforcer 
associated with high effort would be preferred 
to the same secondary reinforcer associated 
with low effort. Preference was measured in 
two ways: (a) subject’s choice in a T maze, 
and (b) time spent in the presence of the 
secondary reinforcer. Each animal experienced 
both high- and low-effort conditions associated 
with either a vertically or a horizontally 
striped area. Half of the subjects experienced 
high effort with vertical and low effort with 
horizontal stripes. The other half experienced 
high effort with horizontal and low effort with 
vertical stripes. During the test phase, the 
animal, without weights, was given a choice 
between the striped areas and the total time 
spent in each arm was recorded. There was 
no reinforcement given during the test trials. 
The results demonstrated that effort is effec- 
tive in influencing the value of a secondary 
reinforcer. Thus, the effect of effort on value 
has been clearly seen in three experiments, 
although its effect was mild. Festinger (1961) 
in discussing the effect of effort, also reached 
the conclusion that for the rat, the effort 
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variable has a weak influence on the shift in 
the value of a stimulus event. 


THEORETICAL ISSUES 


Existing within the body of psychological 
literature are several theoretical positions 
which can account for the fact that effort 
can affect the value of a stimulus event used 
as a reward for the effortful response, as well 
as explain the nondecremental effect of effort 


on a response. Four such positions are given 
below. 


Past Experience 


The history of the organism, either animal 
or human, may be such that responses involy- 
ing differential effort have been associated 
with differential reinforceme 
effort may have been fol 
reinforcement. This relation 
and reward may 
where effort may 
reward, For hum 
man, this has 


a 7 “Protestant 
ed in Cognitions such as, “T 
» it is more 
Festinger’s 


worked harder 
valuable.” Thi 


It c l gued that proprioceptive 
stimuli associated with high effort are more 


intense or perceptually articulate (i.e., have 
greater “Sp” value), and that their discrimi- 
native properties enhance their derived (sec- 
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ondary) value for any given amount of ye | 
nal external reinforcement. After training W 
differential effort, in any subsequent ee, 
the value of a reinforcer in which a respo™ 


n = i: J T: i ust t 
at all similar to the training response 1S 


« oreet 
are 5 cter reinfort 
the apparent value of the external d under 


ibuted 


might seem greater for a group See 
high effort. But this effect could be ar efi 
to a generalization effect, with the high" 


izal 
e . fe neraliz ‘ 
conditions producing a greater ge nditio®: 


not 1 
ather £ 


of response than the low-effort CO 
Thus, the superior performance may 
flect an increased preference, but 1 atter 
differential generalization of response Paba 
based on differential proprioceptive fee mb) 
Such a problem occurred when Lewis raini? 
used a test response similar to the e 
response. By using choice as a test met i 
a response completely different iona 
training response, Lewis (1965) e { 
this interpretive problem. 


Frustration pet 
= x 

A third problem in interpreting prow 
ments on the effect of effort revolves “ight 
the concept of frustration. High effo! rect! 
be viewed as interfering with a g04 net 
response, resulting in frustration and W i 
ing the general drive level of the z. Sent! 
(Brown & Farber, 1951; Hull, 1957 est 
Maccoby, & Levin, 1957), This intetP jotai 
of effort in terms of frustration ™” fiet 
that the effort itself did not directly ra 
the value of the external reinforce", que” 
that it affected drive level and the oe " 
reinforcement value of drive reduction” opie 


The same kind of interpretative a wit 
exists where frustration is defined (Amer: 
drawal or delay of expected rewar y 195 


1958; Holder, Marx Holder, & collie pati 
Wagner, 1959). Qld’, (1953) work © gon 
ularly open to this kind of interp"? sto 
that the failure to receive the acen 
reward could haye led to frustra 


therefore to increased drive. 
Competing Responses 


The last problem in interpreting A e 
of effort is the attentional facto" end 
(1964a, 1964b, 1965) found that effort y "i, 
to hamper any movement or acti” fn | 
directed toward obtaining the goal. 


tio? 


t 
el 
ef j 
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animal experiments, high-effort animals moved 
directly toward the goal area, while low-effort 
animals jumped about and explored the 
Straight-alley maze. The high-effort as com- 
pared to the low-effort children also seemed 
to be more involved in obtaining the chips, 
that is, in looking at the dispenser and in 
turning the crank, rather than looking around 
the experimental room. One might view effort, 
then, as inhibiting those responses that are 
are patible with obtaining the stimulus 
iret thereby increasing the probability of the 
Whit rence of the goal-directed response. 
efort this explanation can best account for 
i differences in a test situation having a 

Ponse similar to the training response (see 


ewis, 1964b), it might also be applicable to 
other situations, 


CONCLUSION 


Ter preceding discussion makes it clear 
any © available literature does not provide 
eliet usive statements concerning the 
MeS effort, However, the data do lend 
an or SUPport to Hull’s (1943) view that 
Á ganism will choose the least effortful of 
responses, The evidence for Hull’s belief 
eon has a subtractive effect on per- 
Vatiets S inconclusive. More data, in a 
hi of experimental settings, is needed 
re this issue is settled. 
chain lly, the data on the effect of the ex- 
even, XE Of effort on the value of a stimulus 
Value associated with effort indicates that 
for ee be enhanced by effort. The reasons 
further 5 effect are not clear, necessitating 
Investigation. 
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THE ROLE OF THE FAMILY IN THE DEVELOPMENT 
OF PSYCHOPATHOLOGY 


GEORGE H. FRANK 


University of Miami 


Psychologists generally make the assumption that the experiences to which the 
individual is exposed over a period of time lead to the development of learned 
patterns of behavior. From this, psychologists have reasoned that the experi- 
ences the individual has in his early life at home, with his family, in general, 
and his mother, in particular, are major determinants in the learning of the 
constellation of behaviors subsumed under the rubric, personality, and in 
Particular, the development of psychopathology. A review of the research of 
the past 40 yr. failed to support this assumption. No factors were found 
in the parent-child interaction of schizophrenics, neurotics, or those with 
behavior disorders which could be identified as unique to them or which could 
distinguish one group from the other, or any of the groups from the families 


of the controls. 


a Psychopathology came to be viewed as 
o consequence of the emotional experiences 
= ich the individual was exposed, interest 

ences Sed on the earliest of such experi- 
ee those that occur in the family. The 

te infant is born incapable of sustaining 
ollown life for a considerable length of time 
ena 28 birth, and is, in consequence, de- 
stitute nt upon the mother or a mother sub- 
Wonde, for its very existence. There is no 
tionshi: A therefore, that the mother-child rela- 
infue D JS @ close one and is expected to be 
s ritial with regard to the psychological 

for eoment of the child. Some explanations 
there € development of psychopathology have 
Ship Ste focused on this particular relation- 
(193). the major etiological factor. Levy 
Patter; 1932, 1937, 1943) has described a 
tection. „centering around “maternal overpro- 
tudes 2 „involving a constellation of atti- 
Pme; Which he felt contributed to the devel- 
(1938 of neurotic disorders, and Despert 
‘elation focused on a kind of mother-child 
Soca which seemed to her to be closely 
Dhren; ed with the development of schizo- 
termed” a pattern which has come to be 

the “schizophrenogenic mother.” 

5: ` hypothesis that the emotional climate 
famiļy_ > terpersonal relationships within the 
} Sonne between the child and its mother 
develo icular—has a decisive part in the 
Woujg ment of the personality of the child 
Seem to have face validity. In part, 


support for this hypothesis may be gleaned 
from the data demonstrating the devas- 
tating effects of being brought up in the 
extreme interpersonal isolation that comes 
from not having a family (Beres & Obers, 
1950; Brodbeck & Irwin, 1946; Goldfarb, 
1943a, 1943b, 1943c, 1945a, 1945b; Lowrey, 
1940; Spitz, 1945) or extreme social isolation 
within a family (Bartmeier, 1952; Davis, 
1940). Moreover, it has been demonstrated 
that various specific emotional behaviors of 
the child seem to be correlated causally with 
factors in the home. For example, children 
who could be described as emotionally im- 
mature, who are dependent, fearful, nega- 
tivistic, emotionally labile, etc., have had 
mothers described as worriers (Pearson, 
1931), overattentive (Hattwick, 1936; Hatt- 
wick & Stowell, 1936), or punitive (McCord, 
McCord, & Howard, 1961; Sears, Whiting, 
Nowlis, & Sears, 1953; Watson, 1934). Chil- 
dren who were described as being overly ag- 
gressive were described as having come from 
homes where mothers were seen as overcon- 
trolling (Bishop, 1951) or punitive (Mc- 
Cord et al., 1961; Sears, 1961). 

The evidence thus far suggests that there 
is, in fact, a correlation between events in the 
parent-child relationship and resultant per- 
sonality traits. The question arises as to 
whether there is evidence which supports the 
hypothesis that there is a correlation between 
events in the parent-child relationship and 
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the resultant complex patterns of behavior 
which have been termed personality. More 
specifically, in light of the theories which 
relate personality development to social, (i.e., 
interpersonal) learning, the question is raised 
as to whether there is any consistent rela- 
tionship between the emotional experience 
the child may have in the home and the de- 
velopment of personality pathology, that is, 
schizophrenia, neurosis, and behavior dis- 
orders. Towards this end, the findings of the 
research that has explored the psychological 
characteristics of the parents of these people 
will be analyzed in order to isolate those con- 
sistent characteristics of the parents that 
may emerge from study to study. The analy- 
sis will be done with regard to each major 
type of psychopathology as a group. More- 
over, because psychological test data might 
yield different information than case history 
analysis, or direct observation of familial 
interaction as compared to attitudes as elic- 
ited by questionnaire, an attempt will be 
made to analyze the information gleaned 
from the studies in terms of the method of 
data collection within the specific psycho- 
pathological groupings, 


SCHIZOPHRENIA 
Case History 
One of the classical 


! methods of = 
lection in the stud cae 


y of Psychiatric illness is 
the information for which 


these observations, 
In so doing, Despert (1938) observed that 
approximately 50% of the mothers of a sam- 
ple of schizophrenic children, generally þe- 
tween the ages of 7 and 13, had been aa: 
scribed as aggressive, overanxious, and over- 
solicitous and were considered to be the 
dominant parent. Clardy (1951) noted that 
50% of the 30 cases of children between 
the ages of 3 and 12 diagnosed as schizo- 
phrenic had families characterized as over- 
protective and yet basically rejecting. Frazee 
(1953) noted the presence of this constella- 
tion particularly when the families of schizo- 
phrenics were compared with the families of 
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children diagnosed as behavior disorder 
Canavan and Clark (1923) and Lampi 
(1933) noted that 30% of the children,” 
psychotics were themselves emotionally i 
turbed. Huschka (1941) and Lidz and e 
(1949) noted that over 40% of their a 
of schizophrenics had parents who were a 
chotic or neurotic. Bender (1936, pan af 
Frazee (1953) noted the high inci 5 oti 
psychopathology in the children of p° 
parents. Preston and Antin (1932); ené 
other hand, found no significant gitter 
in the incidence of psychosis and ee 
a function of parents who were psychos and 
compared to parents who were “normals 38) 
Fanning, Lehr, Sherwin, and Wilson, © 
found that 43% of the children gr o be 
who were psychotic were observet g] ad: 
making an adequate social and person mpl 
justment, with only 11% of that ° 
classified as maladjusted. 40% dj 
Lidz and Lidz (1949) found that 7 oe 
their sample of schizophrenic patie? ” sop 
deprived of one parent by divorce (1938) 
ration before they were 19. Plank chiz” 
found that 63% of his sample °% g w” 
phrenics had families where one Pate? sep” 
absent either due to death or marit? ‘nett 
ration. Wahl (1954, 1956) found thag jos 
was a greater incidence of pare?” peni” 
and rejection early in life for schiz0P 936) 
as compared to normals, and Batty `of if 
found that from the case histories 


ai 


rulers adjudged, post facto, insane by o 
them had lost one of their parents ry w 
time they were 18, However, ance 


a nc 
Bousfield (1937) found that the incl (12 
orphanhood in a psychiatric populate y w 
out of 26) was not much different i on 
incidence of orphanhood in a norm? ee 
lation (19 out of 24). Moreovels g vs 
McGarry, and Friedman (1952) oe at 
the incidence of broken homes and Pi re 
deprivation in the families of schi2P ons 
(34%) was not very different from ue 
found in the families of hospital ie 
(32%), alcoholics (31%), and Pph 


0 +. galP of 
Pressives (34%); indeed, in their 5? oi 


. i! 
neurotics aths d 
showed (49%) and psychop er sto! 
a greater incidence. Ot yO" p 

ave found that the incidence % etw? 


homes in the history of neurotics 1$ 
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A (Brown & Moore, 1944) and 30% 
bel Ces & Hardy, 1947; Wallace, 1935), 
fs erard and Siegel (1950) found no par- 
iy s incidence of broken homes in the fam- 
y history of their sample of schizophrenics. 


Psychiatric Interview 


hin — classical method of obtaining in- 
patients D ega ang the individual with whom 
views vii been living is by having inter- 
mother. chil them directly. The quality of the 
m it d relationship is then inferred 
research at the interviewee says. From this 
(Desne an overwhelming number of studies 
ertin, i 1951; Gerard & Siegel, 1950; 

night. oe Hajdu-Gimes, 1940; Kasanin, 

Fleck k T Sage, 1934; Lidz, Cornelison, 
Corneliso erry, 1957a, 1957b, 1957c; Lidz, 
idz ‘ban! Terry, & Fleck, 1958; Lidz & 

"etze = Lidz, Parker, & Cornelison, 1956; 
amilia] 949: Walters, 1938) describe a 
Overpr an ag characterized by a dominant, 
and a SELVES but basically rejecting mother 
in meth ineffectual father. Yet the data 
rel D, by Schofield and Balian (1959) 
e a Ai rather than differences in 

atric (ge ies of schizophrenic and nonpsychi- 
f i medical) patients, and the data 
the eck, and Siegel (1950) indicated that 
he arava in their study, according 

t TO of the data gleaned from 
ing, tana received adequate breast feed- 
toilet bs no history of particularly difficult 
lems gi or of obvious feeding prob- 
*Dparent not come from broken homes, and 
ished, A y were not unduly rejected or pun- 
trom th Nother factor which seems to emerge 
Istic of ee is that a dominant character- 
ality Tee life of schizophrenics is a 
chaying inappropriateness of thinking and 

atmosph, which seems to infiltrate the entire 
Lidz et ere (Fleck, Lidz, & Cornelison, 1963; 
Meyers al., 1957b, 1957c; Stringer, 1962). 
that eee Goldfarb (1962), however, found 
diagnosa 28% of the mothers of 45 children 
the a as schizophrenic and only 12% of 
Shira. | Were themselves manifestly 

phrenic, 


Ps 
Ych 5 
ta Evaluation 


Oo ltitude r á 
Wida de Questionnaires. One of the most 


y 
Y us 5 sae : 
sed questionnaires in this area of 
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research has been the Shoben (1949) Parent- 
Child Attitude Survey. The Shoben scale 
consists of 148 items which measure the di- 
mensions of parental rejection, possessiveness 
and domination. From the administration of 
this attitude survey, Mark (1953) and Free- 
man and Grayson (1955) reported signifi- 
cant differences in attitudes toward child 
rearing between mothers of schizophrenics 
and mothers of normal children. In compari- 
son with the mothers of the control subjects 
the mothers of schizophrenic patients (Mark, 
1953) were revealed as inconsistent in iheir 
methods of control. They described them- 
selves as being, at times, overrestrictive and 
controlling of behavior, but in some instances 
lax. They frowned on sex play and tended to 
keep information regarding sex from their 
children; they also seemed to frown on friends 
for their children. Their relationship to their 
children appeared inconsistent; they de- 
scribed what could be interpreted as excessive 
devotion and interest in the child’s activities 
while at the same time revealing a notable 
degree of “cool detachment.” Freeman and 
Grayson (1955) found that in comparison to 
mothers of students in an undergraduate 
course, mothers of 50 hospitalized schizo- 
phrenics (ages 20 to 35) tended to reveal 
themselves to be somewhat more possessive, 
but inherently rejecting of their children, and 
particularly disturbed about sexual behavior 
in their children, However, according to these 
same data, the mothers of schizophrenic pa- 
tients did not reveal themselves to be more 
dominant, dogmatic, or inconsistent in their 
attitude than the controls, But most impor- 
tant was the fact that item analysis of these 
data revealed that the attitudes of the moth- 
ers of the schizophrenics and of the controls 
were distinguished on only 14 of the items, 
and then, in general, there was so much over- 
lap that even on these items the statistical 
significance was contributed by a small per- 
centage of each group. Freeman, Simmons, 
and Bergen (1959) included four items from 
the Shoben scale among a larger sample of 
questions posed to parents. These items had 
been derived from a previous study (Free- 
man & Simmons, 1958) and were included in 
the second study because they were the only 
ones in the first study which were found to 
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discriminate between the attitudes of moth- 
ers of schizophrenic patients and those of 
mothers of normals. The items are: 


1. Parents should sacrifice everything for 
their children. 

2. A child should feel a deep sense of obli- 
gation always to act in accord with the wishes 
of his parents. 

3. Children who are gentlemanly or lady- 
like are preferable to those who are tomboys 
or “regular guys.” 

4. It is better for children to play at home 
than to visit other children. 


Freeman et al. (1959) found no capacity for 
these items to differentiate the attitudes of 
the mothers of schizophrenics from those of 
other individuals with severe functional dis- 
orders. 

Zuckerman, Oltean, and Monashkin (1958) 
utilized another attitude scale, the Parental 
Attitude Research Inventory (PARI, devel- 
oped by Schaefer and Bell in their work at 
the Psychology Laboratory at NIMH). The 
PARI was administered to mothers of schi 
phrenics, and it was found th a 


hrenics, at only one it 
distinguished between their attitudes ani 


al children. The 
ended to describe 
ter than did the 
nic children, 


t e 
expected by chance alone), mnerceed. that 
Projective Tests, Sever 


1963; Prout & White, se © Fats, 
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degree of immaturity (Winder & Kantor, 
1958) and the use of defences which are 
essentially reality distorting, namely, denial 
and projection (Baxter et al., 1963) How- 
ever, Prout and White did find more purè 
color without form and less human and ani- 
mal movement and shading responses in the 
Rorschach protocols of mothers of schizo- 
phrenic boys as compared to the mothers ae 
comparable group of boys randomly pe 
from the community, Perr (1958) found ri 
the parents of schizophrenic children gavè m 
sponses to the Thematic Apperception E 
(TAT) little distinguished from those of E 
ents of normal children, and Fisher, BOY 
Walker, and Sheer (1959) found that m 
TAT and Rorschach protocols of the pee 
of schizophrenic patients were measurab y 
different from those of the parents of er 
chiatric (general medical) patients, bat t o 
were not distinguishable from the protocols rs 
the parents of neurotic patients. The mothers 
n the schizophrenics revealed a higher de 
of perceptual rigidity, greater incidence 
dicators of stateless on the Rorscharas 
and less definitely conceived parental eee 
on the TAT than the mothers of the ET 
Direct Observation of Interpersont tudy 
havior, Attempts have been made to 5° of 
the interpersonal behavior of familion ors 
schizophrenics in vivo; some investig? f 
0 
* The conclusion that the Rorschach protocols, od 
$ 


«gin gull 
o thers of schizophrenics were um me of 
Ana Rorschach protocols of the ™° Kanto” 


(as in the research b inder 

y Winder ^ 
eae Baxter et al, 1963) is an interpretation., ro 
esults made by the present author. In facts ve are 
of these articles, the t pre by 
ages differences, However, in th 
af ak and Kantor, the mean rating 0 d s 
Ra urity of personality development or th 
nisthere of the schizophrenics was 2-89 


ers of icle 
ter et al, the normals, 2.43. In the artic degre? 


ię ? Pe means of the ratings of the 
u 
fe poA oi Psychologically immature d e 
E PERNS ach by the parents of poor Pics a, 
neurotics wc good premorbid schizophtet 19 
Though int) respectively, 19.43, 19.62, % id She 
tistical <j Poth of these investigations Vosen 7 
obtaineg eticance was demonstrated pr dhon 
are Ea means, the actual means, in both gretai 
of poyon to each other that the inter awet 
logically significant differences aliy $i 


groups A istit onl 
niant ae basis of the obtained statistie cond 
ane erences seemed a highly doubt 


of 
es 0 
fens? pid 


an, 
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have gone into the home, others have brought 
the family into a hospital setting and ob- 
Served the interaction between family mem- 
bers for an hour or so at a time, others have 
brought the family into a laboratory setting 
(National Institute of Mental Health) where 
the family lives under actual but known con- 
ditions for months at a time, 

Ti the study of the interpersonal relation- 
ships in the actual home setting, Behrens 
and Goldfarb (1958) observed that the per- 
Sonality of the mother seemed to set the 
sone of the family milieu and that there 
ep to be a direct relationship between 
the degree of pathology that could be seen in 

e family setting and the degree of psycho- 
pathology demonstrated by the child. The 
inn they observed appeared physically de- 
ilat ted and crowded. There was a basic 
ie i lon between the mother and father, and 
and athers were basically passive. Confusion 
at disorganization characterized the family 
inaa Phere, with the family demonstrating 

adequate mechanisms to handle emotional 
ep The intensive observation of one 
195 €r-child interaction (Karon & Rosberg, 

8) yielded the observation that the mother 

3S unempathic, She blocked verbalizations 
hie and tended to live vicariously 
child. igh the child, but her relationship to the 
cons, PPeAred to involve a basic, though un- 

“scious, hostility and rejection. The mother 
oe an _ obsessive-compulsive personality, 
Ce nated the home, and was unable to ac- 
a. herself as a woman. The intensive ob- 

vation of 51 families (Donnelly, 1960) 
blot to confirm this finding. Observing the 
ing wi Child interaction in the home, utiliz- 
nell the Fels Parent Behavior Scales, por 
chi A T that mothers treated a Leder 
childr ifferently than their other nonpsychot x 
Wag ct: To the psychotic child, the cat e 
emp Sc 2erally less warm, less accepting, less 

pathic, more punitive, more controlling, 
Passi More overprotective. The father was 
relati €, but more rational than the mother in 
teng °n to the child. Psychotic children 
less ae come from homes aoa 
Sociabij r adjusted, full of discord, a lom a 
intera p. > However, in comparing the ily 
thop ction of schizophrenic patients wi 

© of normal controls, both Perr (1958) 


and Meyers and Goldfarb (1961) found little 
that could stand as a valid measure of dis- 
tinction between the two groups of families, 
Perr found that the parents of schizophrenics 
tend to show more self-deception and to de- 
scribe themselves as being more hostile. Mey- 
ers and Goldfarb found that the mothers of 
schizophrenic children appeared less capable 
of formulating a consistent definition of the 
world for the child. 

A method of directly assessing the inter- 
personal behavior of husband and wife was 
introduced by Strodtbeck (1951). He posed 
questions to each parent individually, then 
he brought them together and had them dis- 
cuss those points where their attitudes dif- 
fered. Farina and his associates (Bell, Gar- 
mezy, Farina, & Rodnick, 1960; Farina, 
1960; Farina & Dunham, 1963) utilized this 
method to study the families of schizophrenic 
patients. The questionnaire they used was the 
PARI. They found that they could distin- 
guish the interpersonal behavior of schizo- 
phrenics otherwise described as having good 
or poor premorbid adjustment. In these stud- 
ies, mother dominance was discerned in the 
families of the poor premorbid group only, 
with interpersonal conflict greatest in that 
group. In comparing the family interaction of 
the schizophrenic patient with those of nor- 
mal controls, Bell et al. (1960) found that 
in the family constellation of normals, author- 
ity tended to be shared by both parents, and 
parental conflict was at a minimum, although 
even here there was a trend towards maternal 

inance. 
en (1951) reported a method of study- 
ing the mother-child interaction under live, 
yet controlled, conditions. The mother and 
“hild were brought into a play room where 
the interpersonal behavior was observed di- 
ly, In 1954, Bowen introduced the prin- 
nee t chnique to the study of fami- 
ciple of m rei, Families were brought 
lies of ae pos to be known as the Family 
into what € National Institute 
Section of NIMH ( i 
Study d there they were ob 
of Mental Health), an 1 but known condi- 
served living under actual bu ths-2 

‘ g periods of time (6 months 
tions for long per! based on families living 

. Observations base! 
thers these conditions revealed that the 
ESS of schizophrenics showed extremely 
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domineering, smothering, close relationships 
with the child (Dworin & Wyant, 1957), with 
the mothers utilizing threat of deprivation to 
control the child. Bowen, Dysinger, and 
Basamanie (1959) observed the Presence of 
marked emotional distance and intense con- 
flict between the parents. The fathers were 
emotionally immature and unable to define 
their role in the family and unable to make 
decisions; the mothers were usually the domi- 
nant ones, affecting a close relationship with 
the child to the exclusion of the father, 
Brodey (1959) found that the behavior of 
the families of schizophrenics was character- 
ized by a selective utilization of reality, par- 
ticularly the use of externalization, and that 
the interpersonal relationships were highly 
narcissistic. 

Perception of Parental Behavior by Pa- 
tients. Several studies have indicated that 
schizophrenics tend to have experienced their 
mother as having been rejecting (Bolles, 
Metzger, & Pitts, 1941; Lane & Singer, 1959; 
Singer, 1954), and dominant, demanding, and 
overprotective (Garmezy, Clarke, & Stock- 
ner, 1961; Heilbrun, 1960; Kohn & Clausen, 
1956; McKeown, 1950; Reichard & Tillman, 
1950; Schofield & Balian, 1959). However, 
when one compares the perception of their 
mothers by normals (Garmezy et al., 1961; 
Heilbrun, 1960; Lane & Singer, 1959; Singer, 
1954) the uniqueness of these attitudes to- 
ward the mothers of schizophrenics disap- 
pears. Recollections of dominance and over- 
protectiveness are common for both schizo- 
phrenics and normals, Although Heilbrun, 
Garmezy et al., and Bolles et al. report data 
which have shown that there is a greater in- 
cidence of a feeling of having been rejected 
on the part of a group of psychiatric patients 
when compared to medical-surgical controls, 
the actual incidence of this even in the psy- 
chiatric group was only 15% as compared to 
1% in the controls, Moreover, Singer and 
Lane and Singer found that Perception of 
Parental relationships during childhood was 
more a function of the subjects’ socioeconomic 
level than was Psychopathology 
finding by Opler a 
terns (parental domi 
a function of cultur 
Trish origin) rather 


` mother during 
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NEvuROSIS 


As compared to the research in the area ak 
schizophrenia, investigations of the Syna 
of the family life of neurotics are few an 
generally restricted to data gleaned from pa 
histories. From these studies it appears m 
the neurotic behavior of the child is a a 
function of the neurotic behavior of oe 
mother (eg., Fisher & Mendell, 1956; Tng- 
ham, 1949; Sperling, 1949, 1951; Zimmer 
man, 1930). Neurotic behavior in children 

as been seen to have been related to ma- 
ternal overprotection (Holloway, 1931; ie 
cobsen, 1947; Zimmerman, 1930), maternal 
domination (Mueller, 1945), maternal re- 
jection (Ingham, 1949; Newell, 1934, 1936; 
Silberpfennig, 1941), ‘separation from the 
the first 3 years of life 
(Bowlby, 1940; Ribble, 1941), and oral dep- 
rivation (Childers & Hamil, 1932), Neurotic 
involvement with the mother, where the 
mother needs the child for the satisfaction 0 
her own needs and discourages the develop- 
nal separation between the 


matic disorders (Miller & Baruch, 


1950; 
Sperling 1949) and school phobia (e.g. 
Davidson 1961; Eisenberg, 1958; Estes, 
aylett A 


Neurosis in children has also been associ- 
ated with such factors in the hom 
(Brown & 


(1935), however, found that 75% 
dren from broken essentially 


; 16% were described as conduct 


disorders and only 97% were classifiable as 
Personality Problems, 
„Of did not use the case 
collection, McKeown 


ding, antagonistic, and 
ndards for them 
4) found that neurotics 
hemselves as having been 
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rejected, particularly as compared to the 
Perception of their family life held by normals 
(Bolles et al., 1941). Although Kundert 
(1947) found that whether justified by ex- 
perience or not (e.g., separation due to hos- 
Pitalization of mother or child), emotionally 
ed children, in general, fear being 
eserted by their mothers and cling to them 
compulsively. The Rorschach protocols of 
Mothers of neurotics reveal that they tend to 
Mize psychological mechanisms which ab- 
‘sate reality, for example, denial and pro- 
Jection (Baxter et al., 1963). 


BEHAVIOR DISORDER 


sa research on the family background of 

‘viduals whose personality problems take 
i: form of antisocial behavior is scanty. 
a ca McKay (1932) found no differ- 
cass in the incidence of broken homes from 
eno referred to Cook County Juvenile Court 
cM, ) as compared to a random sample of 
Coe in the Chicago public school system 
bed Behavior disorders in children have 
eha, Cen to have been related to neurotic 
a in their parents (Field, 1940; 
ea 1941), primarily involving maternal 
Social Tez overt and covert. In line with a 
inding wins hypothesis, another interesting 
ety ng is that a correlation has been found 
oo antisocial behavior in children and 
Soci hildren’s perception of parents’ anti- 
lande behavior (Bender, 1937; K. Fried- 

er, 1945; Williams, 1932). 


L DISCUSSION 
Can St us now summarize what conclusions 
Nate th drawn from these data which ce 
Ment € role of the family in the deve p 
families psychopathology. As regards the 
of „€S Of schizophrenics, from an overview 
Pattern research which has investigated = 
Patho] of parent-child interaction of this 
g CN group considered without p 
oup ° any other pathological or contro! 
eee factors emerge which seem to 
Metho, Te this group, regardless of the 
b ca of data collection, that 1s, „whether 
a history, interview, psychological test, 
rhe observation, Families of schizo- 
Who „o3 Seem to be characterized by mothers 
= dominant, fathers who are passive, 


and considerable family disharmony. The 
mother is overprotective, overpossessive, and 
overcontrolling, yet basically, albeit uncon- 
sciously, rejecting. These mothers frown on 
sex, are inconsistent in their methods of 
discipline, and introduce modes of thinking, 
feeling, and behaving which are not reality 
oriented. In light of the fact that these 
patterns emerge as a function of almost all 
methods of data collection, these results seem 
very impressive. Had our review of these 
data stopped here, we would have had ap- 
parent verification of the thesis that certain 
kinds of mother-child relationships and fam- 
ily atmospheres indeed account for the devel- 
opment of schizophrenia in the offspring. 
However, when each of these parental char- 
acteristics is compared with those which 
emerge from the analysis of the family situ- 
ation of the normal (apparently nonpsychi- 
atrically-involved) individual, each character- 
istic that is found to be typical of the fami- 
lies of schizophrenics is found to exist in the 
families of the controls as well. Furthermore, 
research which has attempted to make direct 
comparisons between the families of children 
in different categories of psychopathology 
(e.g., Baxter et al., 1963; Fisher et al., 1959; 
Frazee, 1953; Freeman et al, 1959; D. 
Friedlander, 1945; Inlow, 1933; McKeown, 
1950; Oltman et al., 1952; Pollack & Malz- 
berg, 1940; Pollack, Malzberg, & Fuller, 
1936) reveals no significant or consistent 
differences in the psychological structure of 
the families. G 
The results are the same with regard to the 
milies of the neurotics as well. At first 
ars that the mother’s neurotic 
involvement with the child is causally associ- 
ated with the neurotic behavior of the child. 
However, the essential characteristics of 

n involvement—maternal overprotective- 
this 1m ination, maternal rejec- 
ness, maternal dommatol, i d the 
tion, deprivation and en = = 
mothers fostering an almost 2 fet chil- 
tionship between themselves an co oani 
dren—are basically the same aker 


i i d of chil- 
i ‘ies of schizophrenics an i 
se e beh disorders. Moreover, 1n 


with behavior ; 
eit respects, it would be hard, on bling 
anaes to distinguish the family which pro- 
ee an emotionally disturbed child from 
uc 3 


fa 4 
glance, it appe 
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that which produced the so-called normal or 
well-adjusted child. 

It seems apparent that the major con- 
clusion that can be drawn from these data 
is that there is no such thing as a schizo- 
phrenogenic or a neurotogenic mother or 
family. At least these data do not permit of 
the description of a particular constellation 
of psychological events within the home and, 
in particular, between mother and child that 
can be isolated as a unique factor in the 
development of one or the other kind of per- 
sonality disorder. If one is looking for the 
factor to account for the development of 
neurosis or schizophrenia, that one factor 
does not appear to exist as a clear cut finding 
in the research. 

It is incumbent upon us to wonder why the 
research literature does not permit support 
of a hypothesis regarding parental influence 
on the psychological development of children 
in the manner we hypothesized. One of the 
major problems with which we must contend 
is that human behavior is a very complicated 
event, determined by many factors, and not 
clearly understood out of the context in 
which it occurs, and, in this regard, not 
everyone reacts in a like manner to similar 
life experiences. For example, strict discipline 
is reacted to differently when this occurs 
in a “warm” or “cold” home atmosphere 

(Sears, 1961); maternal rejection is reacted 

to differently where the father is accepting 

and warm (McCord et al., 1961) as well as 
where the father can be a buffer between the 
child and the overprotective mother (Witmer, 

1933), Emphasizing the multivariate aspect 

of the determinants of behavior, one notes 

that Madow and Hardy (1947) reported 
that of the soldiers who broke down with 
neurotic reactions there was a high incidence 
of those coming from broken homes. Amongst 
those soldiers who did not break down, the 
incidence of coming from a broken home 
was 11-15%; the incidence of broken homes 
in the history of soldiers who did break 
down was 36%. Statistically, there is a sig- 
nificant difference between these percentages; 
however, even the 36% datum leaves 64% 
of the soldiers who broke down mot coming 
from a broken home. Huschka (1941) re- 
ported that the incidence of neurotic mothers 


of problem children is high (42%); how- 
ever, this leaves 58% of the group not 
accounted for by this factor. Brown ang 
Moore (1944) commented that the incidence 
of excessive poverty, drunkenness, and fam- 
ily conflict in soldiers who broke down 
was significant, but this accounted for only 
20% of the cases. Although between 30% 
(Canavan & Clark, 1923; Lampron, 1933) 
and 40% (Huschka, 1941; Lidz & Lidz, 
1949) of the children born to mothers who 
are psychotic become psychotic themselves) 
these percentages do not account for the 
majority of children born to these mothers. 
Indeed, Fanning et al. (1938) found ae 
43% of the children born to mothers wh? 
were psychotic were observed to be making 
an adequate social and personal adjustment; 
only 11% of that sample of children was not 
It should be noted that only half of me 
samples of mothers studied by Desp a 
(1938) and Clardy (1951) resembled the 
traditional pattern of what has come tO Hi 
known as the “schizophrenogenic mother- 
Finally, Beres and Obers (1950) observe 
that there is a wide reaction to an experienc? 
of emotional deprivation (in this instanc® 
institutionalization) ranging from the de 
velopment of a schizoid personality to schizo- 
phrenia itself, and including neurotic re4° 
tions and character disorders, Indeed, 25% ° 
their sample of children who were brought 
up ìn institutions appeared to be making # 
satisfactory adjustment in spite of this 
ostensibly devastating experience, 

Over and above the complexity of human 
behavior contributing to the inconclusivenes> 
of the results, one must look at the way in 
which these data have been collected. It 
might be that the criterion measure, that is; 
the diagnosis, did not provide the investi- 
gator with meaningful groupings of subjects 
so that consistent findings could be obtained: 
As regards the method of data collection 
Case histories may be inadequate in pro" 
viding basic data; Information can be gross 
and/or inaccurate; The informant has to relY 
on memory, and this memory might be ©?” 
sciously or unconsciously selective, oF ee 
informant might not be aware of the imp° 
of or feel shame in giving certain data. ie 
despite the many limitations of this m° 


ROLE OF THE FAMILY 


of data collection, some of the primary re- 
search in schizophrenia has utilized this 
method, and almost all of the data with re- 
spect to the family life of neurotics and 
behavior disorders were gathered in this way. 
These same limitations apply to data that are 
gathered when the informant is asked to fill 
out an attitude questionnaire. Surely the 
data on parents elicited from the children are 
Susceptible to distortion even when given by 
normal children, no less those who already 
tend to consciously or unconsciously con- 
found their perception of reality with fantasy. 
hg psychiatric interview is a much more 
Sensitive procedure than the case history oF 
ooo questionnaire. Either structured or 
BR -ended interviews enable the interviewer 
infi olo up leads and possibly detect where 
oF ormation is being omitted for one reason 
th: another. The problem here, however, 1S 
nat there is always the possibility that 
'storted or inaccurate data are gathered by 
© interviewer, either through the kinds of 
goths asked, or the perception of the 
~'swer or of the individual being interviewed. 
alth example, it is interesting to note, that 
vi ough the majority of psychiatric inter- 
ie experienced the mothers of schizo- 
T nics as matching the model of the 
Ove; Zophrenogenic mother—the dominant, 
"Protective, but basically rejecting mother 
% induces inappropriateness of thinking in 
ion children—the psychological test evalua- 
Cong of mothers of schizophrenics failed a 
is rm these findings. One explanation = 
quainte that the interviewer, already nod 
Moth ed with the literature: regarding ca 
Derien, of schizophrenics, anticipating Te 
P the mothers in terms of the A 3 
De - Schizophrenogenic mothers, did, indeed, 
perience them in that way, whereas a more 

i lective evaluation of the patterns of think- 
and feeling of these mothers did not 
han the more subjective impression. r) 
n ar to try and avoid the pitfalls in- 
T int in data gleaned through case Histo 
Bie es investigators hypothesized tha 
acti observation of the mother-child inter- 
valid information. 

mitations inherent 


confi 


BR re: 
Unfort might yield more 
in nately, here, too, li : 
© mode of data collection become aP 


Da, ; 
tent. Observations of the mother-child 
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interaction in the home or in an observation 
room in a hospital or clinic are generally 
restricted to a limited time segment, for 
example, 1 hour once a week. This factor, 
in and of itself, limits the observations to 
a fairly restricted aspect of the spectrum of 
the interaction between mother and child. 
Here, too, the behavior to which the observer 
is exposed may be influenced by the con- 
scious or unconscious attitudes and motives 
of the parent being observed. It is not too 
difficult for the parent to present only that 
behavior which, for one reason or another, 
she feels it safe to display and to control the 
presence of other behaviors. Direct observa- 
tion of the family for extensive periods of 
time, that is, months, and under controlled 
but as natural as possible living conditions 
(as in the Family Study Section of NIMH) 
avoids the restrictiveness and overcomes, to 
one degree or another, the artificiality of the 
relatively brief observation. However, the 
family is still aware that they are being ob- 
served and may, to one degree or another, 
be unable to act “natural.” Moreover, unless 
the observations are independently made by 
several people whose reliability of observa- 
tion has already been established, they may 
also be influenced by the Zeitgeist and per: 
ceive the family as being “schizophrenogenic” 
whether it is or not, mutually reinforcing 
each other’s expectations. A more pressing 
consideration in evaluating the validity of 
these kinds of observations 15 the fact =a 
the interaction, no matter how natural, a es 
place after the development of the psyc a 
pathology. It is quite possible that peste 
pects of the interpersonal relationship Fauld 
he family, or between the mother and chi 
tne #ar T ntuated in the develop- 
in particular, that eventuate? "i ine, aii 
ment of the patterns of thinking, E 4 on 
behaving characteristic of the sc ate i 
he neurotic, are nO Jonger present; they 
ey urred at a time of the child’s 
Cee past and/or under conditions 
ot even accessible to the ob- 
jg no reason to assume that 
- logical factors are still functioning 
available to the trained 
server even over the course of 6 months. of 
Qns e, it might be that whatever differenti- 
ae he psychological existence of the schizo- 
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phrenic from that of the neurotic or of the 
normal might be so subtle that it is imper- 
ceptible to the participants themselves or 
even the trained observer and, hence, escape 
notice. Here, one is reminded of Freud’s 
comment that “the years of childhood of those 
who are later neurotic need not necessarily 
differ from those who are later normal ex- 
cept in intensity and distinctness [Freud, 
1938 (Orig. publ. 1910), p. 583].” 
Theorizing about the etiology of psycho- 
pathology has characteristically been of the 
either/or variety. Nineteenth century scien- 
tists sought explanations for neurotic and 
psychotic disorders in the hereditary back- 
ground of their patients, working from the 
assumption that many directly inherited the 
neurotic or psychotic “illness.” On the other 
hand, the scientist of the twentieth century 
has sought explanations for psychopathology 
in the experiential aspect of man’s life, in 
his emotional and interpersonal learning. As 
with most events in our life, the truth is 
probably somewhere in between these two 
positions. Indeed, in spite of the emphasis 
that is placed on the role of experience in the 
development of personality in psychoanalysis, 
Freud did not think, at least as regards the 
etiology of psychopathology, in such cate- 
gorically black and white terms. He was able 


to bridge the gap between the nature-nurture 
extremes: 


We divide the causes of neurotic disease into those 
which the individual himself brings with him into 
life, and those which life bring to him—that is to 
say, into constitutional and accidental. It is the 
interaction of these as a rule first gives rise to illness 
[Freud, 1950b (Orig. publ. 1913), p. 122]. 


Let us bear clearly in mind that every human 
being has acquired, by the combined operation of 
inherent disposition and of external influences of 
childhood, a special individuality in the exercise of 
his capacity to love—that is, in the conditions which 
he sets up for loving, in the impulses he gratifies 
by it, and in the aims he sets out to achieve in it. 
... We will here provide against misconceptions 
and reproaches to the effect that we have denied the 
importance of the inborn (constitutional) factor be- 
cause we have emphasized the importance of in- 
fantile impressions. Such an accusation arises out 
of the narrowness with which mankind looks for 
causes, inasmuch as one single causal factor satisfies 
him, in spite of the many commonly underlying 
the face of reality. Psycho-Analysis has said much 
about the “accidental” component in aetiology and 


little about the constitutional, but only because ae 
could throw new light upon the former, ane p 
the latter it knows no more so far than is a 
known. We deprecate the assumption of an essential 
opposition between the two series of aetiologica 
factors; we presume rather a perpetual interchange 
of both in producing the results observed [Freud, 
1950a (Orig. publ. 1912), p. 312]. 


Other psychoanalysts have followed Freud in 
the presumption of an inherent, predeter- 
mined characteristic functioning of the nerv- 
ous system of the human organism which 
determines reactions to stimuli pre- and post- 
natally (e.g., Greenacre, 1941). ; f 

Augmenting the clinical observations g 
psychoanalysis, one must juxtapose the eRe 
perimental evidence in psychology which indi 
cates that (e) individuals reflect character” 
istic patterns of autonomic activity which arè 
stable and which are typical of them as 
individuals (Grossman & Greenberg, 1957; 
Lacey, 1950; Richmond & Lustman, 1955; 
Wenger, 1941), (b) the characteristic pat 
terns of neural activity are identifiable Pre 
natally and are consistent with the patterns 
of activity observable postnatally (Richards 
& Newbery, 1938, (c) these characteristic 
patterns of autonomic activity consistently 
emerge in a factor of lability and balance 1” 
which specific personality factors are CO” 
sistently highly loaded (Darling, 1940; EY 
senck, 1956; Eysenck & Prell, 1951; Theron, 
1948; van der Merwe, 1948; van der Merwe 
& Theron, 1947), (d) there is greater siml- 
larity of autonomic reactivity between iden- 
tical twins than fraternal twins or ordinary 
siblings (Eysenck, 1956; Eysenck & Prell, 
1951; Jost & Sontag, 1944), and (e) there is 
a selective influence on Personality function- 
ing due to the sex of the individual per se. 
For example, generally boys outnumber girls 


2-1 in being referred for Psychological help 
(Bender, 3 


1937; Wile & Jones, 1937), Sears 
(1961) found a significant difference in the 
basic mode of self-reported expr 


: ession of ag 
gression between boys and girls: Girls aP 
peared higher 


in socially acceptable forms 
of aggression and high in anxiety regarding 
hostility, while boys were significantly highet 
in aggression that was directed against socla 
control. Sears also found that the more puni 
tive the mother is, the more dependent the 
son becomes but the less dependent 
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daughter becomes. Newell (1936) found that 
maternal rejection affected males more than 
females: Marked increase in aggressive be- 
havior was noted in the boys who experienced 
rejection, not so with the females. Baruch 
Rs Wilcox (1944) noted that interparental 
tensions lead to different reactions in boys 
aS compared to girls; in boys, it led to 
ascendance-submission problems, in girls, to 
an experience of lack of affection. 

We end this survey by concluding that we 
“ave not been able to find any unique factors 
= the family of the schizophrenic which dis- 
tinguishes it from the family of the neurotic 
i from the family of controls, who are 

Stensibly free from evidence of patterns of 
ay Psychopathology. In short, we end by 
is a that the assumption that the family 
ity Me factor in the development of personal- 

as not been validated. It is interesting 
C note that Orlansky (1949), in his review 
AA literature exploring the relationship 
are, certain childhood experiences, for 
Sacer e feeding, toilet training, thumb- 

ing, the degree of tactile stimulation by 
persoo other, etc., upon the development of 
ei characteristics, was also forced to 
Beas that the data failed to confirm an 
in nee relationship between the experience 
Cours, ancy and the resultant personality. Of 

e ©, it might well be that the reality of 

in qc ™ily is not the important dimension 

termining the child’s reactions; rather, 
femte be the perception of the family 
r = and this might often have little 
ar relation to the people as they really 
'S would mean, then, that in many 
S the important variables in the de- 
tors th of psychopathology might be fac- 
t u ich the child brings to the family, 
nctioning of the nervous and metabolic 
e and the cognitive capacity to inte- 
Ce sare into meaningful perceptual and 
Wo a= schema, Indeed, we are left to 

a as do the psychoanalysts, whether 
reality livity towards fantasy distortion of 
Velopm, might not be the factor in the de- 
livity ent of psychopathology, and this pro- 
the dae not be always determined by 

vio S experiences per se. ; 
By usly, questions regarding the etiol- 
Patterns of personality behavior which 


Mstan ce. 
Yelopme 


Ste 
Srat 
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are regarded as pathological, unadaptive, or 
unadjusted cannot be met with simple an- 
swers. Apparently, the factors which play a 
part in the development of behavior in 
humans are so complex that it would appear 
that they almost defy being investigated 
scientifically and defy one’s attempts to draw 
meaningful generalizations from the explora- 
tion which has already been done. It is, of 
course, conceivable that human behavior is 
so complex that it cannot be reduced to 
simple terms or be expected to yield unalter- 
able patterns of occurrences. It might also 
be that what produces psychopathological re- 
actions in one individual does not in another, 
All this would be understandable in light of 
the complexity that is the human being, 
neurologically as well as socially, but it is 
unfortunate as regards research endeavors, 
In 1926, Freud wrote: 


Anxiety is the reaction to danger. One cannot, after 
all, help suspecting that the reason why the affect 
of anxiety occupies a unique position in the econ- 
omy of the mind has something to do with the 
essential nature of danger. Yet dangers are the 
common lot of humanity; they are the same for 
everyone. What we need and cannot lay our fingers 
on is some factor which will explain why some 
people are able to subject the affect of anxiety, in 
spite of its peculiar quality, to the normal workings 
of the mind, or which decides who is doomed to 
come to grief over the task [Freud, 1936, p. 64]. 


We end this review of forty years of research 
without being able to feel that we are any 
closer to an answer than was Freud. 
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FUNCTIONAL AND ADAPTATIVE ASPECTS OF CELLULAR 
REGULATORY MECHANISMS 


W. W. MEISSNER 
Harvard Medical School 


Recent developments in biochemical genetics, namely, the understanding of the 
mechanisms of regulation of cellular processes through the operation of 
regulator genes, brings the genome into intimate involvement in the adaptative 
processes of the organism. The dichotomy of structure and function is begin- 
ning to dissolve in a deeper understanding of the function of the organism. 
Available evidence indicates that similar regulatory mechanisms are operative 
in man which control the integration of hormonal activity and serve as the 
organic basis for the correlation of structure and function, even in higher 
cerebral functions related to emotional and learning behavior. 


The deepening scientific understanding of 
basic cellular processes has reached a point 
jn recent years where an understanding of 
such subcellular mechanisms can begin to 
contribute to our deeper understanding of 
higher functions of the organism. The cur- 
rently developing understanding of basic 
processes has opened avenues of thinking and 
research which strike much more closely at 
the heart of the more strictly functional as- 
pects of the behaving organism. In some 
radical sense, the lines of division between 
structure and function are beginning to fade. 
The possibility presents itself on many fronts 
of linking various aspects of function with 
basic cellular processes. It may be possible to 
reach some understanding of the effect of 
functional adaptations of the organism, like 
learning, or functional maladaptations, like 
schizophrenia, in relation to basic processes. 

One of the most significant areas in which 
our understanding has approached this level 
is in the exploration of cellular regulatory 
mechanisms. A new understanding of the re- 
lation between cellular functioning and struc- 
tural change has been developed in the last 
few years, based on an appreciation of the 
role of genetic mechanisms in controlling cel- 
lular responses. Much of the basic under- 
standing of these mechanisms was achieved 
in simple cellular systems like the bacteria, 
most notably Escherichia coli. A review of 
these cellular mechanisms, therefore, will help 
to provide a clearer understanding of their 
relevance and significance for more strictly 


"E roc” 
functional and specifically psychological p 
esses. 


3 ich 
Over the course of the many years 19 hig - 
students of human behavior have pech jr 
vestigating genetic aspects of behavior, gt- 
general orientation toward their subject se 
ter was principally dictated by the lin etics 
thought laid down by the science of Ze% e- 
itself. Under the impact of Mendelian 
netics, psychologists and psychiatrists 
centrated their efforts on the determi” c 
of the genotypes corresponding to the py 
logical phenotypes. The overriding ©"; 
was to identify the genetic factors t0 W j- 
the observed variance in psychological 4/10 
ties and performance could be ascribed- of 
formulation of the notion of inborn ert" 1. 
metabolism (Garrod, 1923) and the ete 
lishment of a link between genic determi ys 
tion and enzyme synthesis gave the stimt ic 
to the tremendous effort to specify gene 
factors in the etiology of mental illness. 
These are the interests which domi”? 
contemporary thinking about behavioral 2, 
netics. A common factor in this general Fs 
proach has been the operative conceptio” en 
the genetic material as a hereditary She 
which determines the basic structure °% uc’ 
organism. The genetically determined $ eit 
ture is then regarded as capable of, op? 
modified to one degree or other, oT mot 
fashion or other, by a variety of €% of 
mental influences. This basic conceptio p 
the gene as somehow related to structu «ine 
underlain the whole course of our 


atio” 
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about the relationship between heredity and 
environment, The hereditary (genetic) factor 
Was regarded as responsible for the structure 
of the Organism, whether macroscopically in 
terms of gross architecture of the body, or 
ener oscopically in terms of cell structure or 
ana En molecular structure of enzyme sys- 
modif =e given hereditary structure was then 
TENS developmentally and functionally by 
wae nmental influences. Consequently, it 
ieee to approach the study of the 
iG tee ton between heredity and environment 
could A of the amount of variance which 
e attributed to each. 

© identification of deoxyribose nucleic 
NA) as the genetic material (Avery, 
d, & McCarthy, 1944) gave rise to a 
eluci dae of biochemical genetics. With the 

a of the structure of DNA by 
impetus and Crick (1953), a tremendous 
chemic TE given to the study of the bio- 
informati bases for the transfer of genetic 
mechani lon. Little by little the picture of the 

S tr Sms by which the genetic information 
chro; ee from DNA in the nuclear 

to specific proteins in the cellular 
anster m was pieced together. The role of 
Ynthes ribonucleic acid (t-RNA) in protein 

ht ra (Hoagland, Stephenson, Scott, 
e Zamecnik, 1958), the clarification 

Tole of messenger ribonucleic acid (m- 
a Protein synthesis (Brenner, Jacob, 
1961). SOn, 1961; Nirenberg, & Mathaei, 
Benetice and the slow progress in breaking the 


aciq ( 
MacLeo 
new er 


Wat o (Crick, Barnett, Brenner, & 
196 Obin, 1961; Jones & Nirenberg, 
ingly p aS given us a consistent and increas- 


duces more accurate idea of how the cell pro- 
aciq F Toteins out of the supply of amino- 
Counti Vailable to it, A clear and concise re- 
cent] & of this whole development has 
The p Peen Provided by Beadle (1964). 
the elu ole development was directed toward 
chemic q dation, on the molecular and bio- 
the e „level, of the mechanisms inherent in 
tiñe geant structural orientation of scien- 
Morg n tics, The DNA template was still 
phic % less regarded as a structural given 
2 tonjyp ed, by the intermediates of m-RNA 
Action tHon with the RNA in the ribosomal 

> '0 give rise to the structural units 
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out of which the organism was constructed, 
the proteins. To this extent, then, recent 
developments in biochemical genetics have 
not altered the basic structural-functional 
dichotomy which has governed our thinking 
about behavioral genetics. 

While rapid strides were being made in 
investigating the function of DNA and RNA, 
other developments were being made which 
seem quite likely to revolutionize our thinking 
about the operation of genetic factors in the 
adaptation of the organism. As soon as it had 
become clear that DNA was directly in- 
volved in protein synthesis, attention was 
inevitably directed to the question of the 
regulation of protein synthesis. For it was 
quite clear that if all of the genes in a given 
cell were to produce protein at about the 
same rate, an extremely unwieldy situation 
would soon develop. The cell in fact varies 
the rate of protein production under differing 
conditions. The continued production of un- 
needed proteins would prove to be very un- 
economical (Zalokar, 1961). Furthermore, 
when the necessity arises to produce a given 
protein in greater amounts than usual, it is 
unreasonable to accelerate the total produc- 
tion of RNA in order to produce a sufficient 
amount of the RNA needed for the produc- 
tion of the one protein. Moreover, the organ- 
ism develops from a single set of diploid 
chromosomes in the zygote, Yet in the adult 
animal tremendous ranges of differentiation 
are manifest, both in structure and function 
of the respective cells. Clearly, some mecha- 
nism must be provided whose function is to 
regulate the sequential activation of the gene 
potential during the different phases of the 
organism’s life. 

Once the problem of cellular regulation had 
been brought into focus, it became abundantly 
clear that the cell does in fact alter the 
rates at which various protein products are 
produced. Most of the studies of cellular 
control mechanisms have been conducted with 
bacteria, primarily the ubiquitous Escherichia 
coli. It was found, for example, that aspartate 
transcarbamylase of E. coli was strongly in- 
hibited by cytidine triphosphate, its bio- 


208 


synthetic product. The reactions can be rep- 
resented in schematic form: 


Carbamyl-phosphate + aspartate i 
— carbamyl-aspartate ——>— Cytidine 
; triphosphate 


Maximal inhibition was achieved with cytidine 
triphosphate rather than cytidine diphos- 
phate or any other closely related compound, 
indicating that the inhibition was quite 
specific. The inhibition was reversed by in- 
creasing the concentration of aspartate (Ger- 
hart & Pardee, 1962). A similar reaction pat- 
tern was discovered for inhibition of L- 
threonine deaminase by L-threonine (Chan- 
geaux, 1961). Both series of reactions had 
certain common characteristics: (a) the in- 
hibited enzyme was the first in the series 
which led specifically to the production of 
the endproduct; (b) inhibition was appar- 
ently achieved by the endproduct itself, 
rather than by some derivative; (c) when 
the level of the endproduct decreased, the 
enzyme proportionally increased the rate of 
production (reversible regulation). Umbarger 
(1961) has proposed two models for this sort 
of feedback inhibition: competitive inhibition 
between substrate and endproduct with 
catalytic and inhibitory sites overlapping; or 
combination of the endproduct with the 
enzyme at a site which does not interfere 
with the binding between enzyme and sub- 
strate. In the latter case, inhibition of 
catalysis would be effected by some sort of 
allosteric effect of the binding of the end- 
product to the enzyme. A considerable 
number of such feedback inhibitions of 
enzyme systems have been demonstrated 
(Umbarger, 1961). 
More significantly, however, other systems 
have been discovered which involve the 
selective activation of enzyme systems and 
the production of proteins. The shift from the 
utilization of glucose to the utilization of 
galactose in the nutrient medium by E. coli 
was found to involve the production of a new 
protein, f-galactosidase (Pardee, Jacob, & 
Monod, 1958, 1959). The increase in activity 
in the aspartate transcarbamylase system was 
also found to be accompanied by selective 
synthesis of a new protein having the same 
electrophoretic mobility as the transcarbamy- 
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lase (Shepherdson & Pardee, 1960a, ae 
Yates & Pardee, 1957). Similar effects V s- 
also demonstrated for the ornithine 1959) 
carbamylase system (Rogers & Novelli, 1 iani 
and for alkaline phosphatase O a 
1960). In each such case of selective ac that 
tion of the enzyme system, it was shown i 
the production of new enzyme was 1? cific 
tion. But how was the production of a spe 
enzyme brought about? t 
There is good evidence to suggest vals 
adaptational effects within bacterial that 
affect specific proteins selectively, a? ro- 
the enzymatic adaptation involves the n 5 
duction of protein from available ae 
(Hogness, Cohen, & Monod, 1955; a o 
& Spiegelman, 1954), The regulation e 
enzyme production must in some eal 
responsive to the functional state of the nust 
Consequently, the activation signal oe , 
derive from the cytoplasm or its ete 
Thus, the presence of galactose in the tion 
terial cell acts as a stimulus to the agoi the 
of the production of B-galactosidase, 4” the 
presence of glutamate in the cell acts 4§ 
stimulus for the production omi 
transcarbamylase. Such substances have tion 
to be known as “inducers” since their 4° 
is the induction of enzyme synthesis. a sagt 
In the DNA-RNA schema for protein Pr, 
duction, the site of operation of such in! ie 
can be either the secondary template ° of 
polyribosomes or the primary template |. 
DNA itself. It is quite possible that in ©", 
ferent organisms and different systems; aa 
or the other site is affected. But there is gor. 
evidence that inducers (and their opposite 
the repressors) are influencing the product” 
of protein at the level of the DNA templat 5 
Such evidence is based on results of co 
tinuous cultivation experiments under J 
pressive conditions (Cocito & Vogel, 19574 
on studies of coordinate repression (Ames of 
Garry, 1959), on studies of the immunitY ti- 
lysogenic cells (Jacob, 1960), and on inve es 
gations into the function of operator Be): 
(Jacob, Perrin, Sanchez, & Monod, ! ort” 
Some convincing evidence has been put "ine 
to suggest that in at least some systema ect? 
significant exchange which reflects the © NE 
of induction takes place between the ~ auc’ 
template and the messenger-RNA. Í 
tion takes place at the RNA level, the? 
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must be continuously present in the cell, and 
induction could be carried out without any 
NA synthesis, Kramer and Straub (1956) 
ound that the destruction of existing RNA 
one did not affect inducibility. 
ge g ribonuclease after the inducer, 
Aken did significantly impair the pro- 
iung Ben new enzyme. It has also been 
Sibi ty 3 prevention of induction was pos- 
synthesi the use of analogs to inhibit RNA 
shai 1s SPiegelman, Halvogørsen, & Ben- 
is the 55). Furthermore, if RNA synthesis 
of a aay effect of induction, the action 
3 cheval necessarily involved. Destruction 
ea Ce DNA after the incorporation of 
induced Oys the cell’s ability to produce the 
enzyme (McFall, Pardee, & Stent, 

). Also Gale and Folkes (1955) were 
om induce B-galactosidase formation in 
OSenized preparation of E. coli; they 
le to suppress synthesis of the enzyme 
“oxyribonuclease, and subsequently re- 
thie process by the addition of bac- 
A. Synthesis of RNA appears to be 
enzyme ee of the inductive activation of 
Ver, th ystems. It has been argued, more- 
RNA a the incorporation of precursors into 
the cong eS that continues throughout 
tii cycle of the cell (Lark & Lark, 
Soma] that the rate of synthesis of ribo- 
Chemica A is more highly responsive to the 
eith environment than the synthesis of 
Praen teins or DNA (Neidhardt & 
Mechanic 1961). It provides a very sensitive 
the me of response to slight changes in 
L961). ‘ronment of the cell (Schaechter, 


he 
avai z x ‘ 
“onside Vailable evidence, of which there is 


r 
to Preseny more than we have undertaken 
wer t here, has been formulated into a 


lonog of cellular regulation by Jacob and 
wate pola, 1961b) and by others. We 
melatory an insight into the nature of 

o bon o oesses by a brief review of the 
Stalis S in the theory. 

Ig to this model of cellular regu- 
b es Synthesis of enzymes is subject to 
ch ian oe control, The structural genes, 
bes tite long been the objects of genetic 
“ign (ON, determine the molecular organ- 
Protein products by specifying the 
Diote aminoacid residues composing 

em structure, Other functionally 


P> 
lation © 
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specialized genetic determinants are neces- 
sary, however, to determine the mode of 
operation of the genetic mechanism. These 
specialized genetic components are the regu- 
lator and operator genes, and their function 
is to control the rate of protein synthesis 
through the intermediate action of cyto- 
plasmic components. Umbarger (1961) has 
offered the following observation regarding 
these genetic components: 


A feature of the chain of events between gene and 
enzyme that has received emphasis in this sym- 
posium is the concept that the presence of a par- 
ticular region of DNA containing the information 
for the amino acid sequence of a particular protein 
is not by itself sufficient condition that a particular 
protein will be formed. The role of the regulator 
and operator as genetically controlled mechanisms 
enabling the structural gene to respond to environ- 
mental changes has been sufficiently well documented 
that any attempt to explain the events of gene ex- 
pression must incorporate these control features into 
the proposed scheme [p. 301]. 


Regulator and operator genes exercise their 
effects through the action of cytoplasmic re- 
pressors. The repressor can either be in- 
activated by the process of induction, or it 
can be activated by the opposed process of 
repression; in each instance, the induction 
or repression is carried out by means of 
special metabolites. 

The system operates directly at the level 
of the synthesis of m-RNA from the DNA 
gene-template. The messenger is produced by 
a replicative sequential process from the 
structural gene. Thus the structural gene 
transfers the encoded genetic information to 
the m-RNA, which then transfers the in- 
formation to the cellular ribosomes determin- 
ing them to produce the specific protein re- 
quired by the genetic code. The transcription 
of the encoding sequence of the structural 
gene is dependent on an initiating point, the 
operator gene. Each messenger may code for 
one or more polypeptides, and a series of 
structural genes involved in the encoding for 
a series of interrelated proteins may be under 
the coordinate control of a single operator. 
Such a unit of transcription is called an 
operon. The operator combines specifically 
and reversibly with the RNA fraction or 
protein which possesses the proper comple- 
mentary sequence of amino acids. The avail- 
ability of the operator locus is essential to 
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the formation of m-RNA, so that the com- 
bination of a repressor substance with the 
operator locus can block the cytoplasmic 
transcription of m-RNA, and thereby diminish 
the rate of protein synthesis, since the rate 
of m-RNA synthesis is the critical factor in 
determining the rate of protein production. 
The function of the regulator gene in this 


| Structural gene for 
l 
| 
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schema is to synthesize the repressor which 
combines with and represses the operator 
locus. : 

As an example of the operation of a 
schema we can consider the iduenon a 
-galactosidase by lactose in the cell mediu $ 
(see above.) The induction process is visua 
izable in diagram form. 


| 
| . 
Regulator Operator 8-galactosidase Genome 
| 
| 
l 
| i | 
Repressoe===] } } }ëųă -RNA 
7 m-RNA 
| 
$ Ribosome 
| i e 
| B-galactosidase Enzyme 
l 
| 
l 
| 
i 
Inducer (Lactose) 


In this case, the repressor substance, pro- 
duced by the regulator gene, inhibits the 
action of the operator gene and thus prevents 
the transcription of m-RNA for the produc- 
tion of g-galactosidase. Absorption of the 
substrate lactose brings the inducer, lactose, 
into contact with the repressor. The inducer 
inactivates the repressor and permits the 
operator to initiate the transcription of m- 


i 
Regulator Operator | 


e a aa 
RNA. The cell thus responds to the presenc? 
of lactose in the cell medium by the iai 
of B-galactosidase formation which ena! 
the cell to utilize lactose and galactose 4 
nutrient, e 
A similar example can be found in i 


case of the production of ornithine tt2”* 
carbamylase, 


“Structural gene for ornithine 


transcarbamylase 
Sau — 2 X Genome 
| 
ji 
| 
| | 
Apo-repressor-———— i To oa SE m-RNA 
== m-RNA 
Co-repressor~————1 t Í 
f | | Ribosome 
l | 
| 
| nn. 4 Enzyme 
| P a T eS 
i | Ornithine transcarbamylase 
| Inducer 
I ! 
| 
| 
i 
EA 
Arginine Glutamate 
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In this instance, arginine is required as a 
co-repressor in order that the apo-repressor 
Produced by the regulator gene be effective. 
Onsequently, when the level of arginine in 
Ne cell is low. 
sible, Synthesis 
creases the acti 


e fi : . 
Substrate glutamate, thus increasing the 


addition, acts as an inducer, 
the production of enzyme by in- 
€ combined action of apo-repressor 
ressor on the operator. 
that hay, ystems are only two of a number 
onsequies reti defined with some security. 
a E Y, they are representative only 
anisms T. of such intracellular control mech- 
lik ly » the full scope of which is more than 
Jacob Tg broad and varied. Moreover, as 
Pointe aie Monod (1961a, 1961b) have 
can be ee not all the points of these models 
Mo: ore with the same confidence. The 
of regulary, established factor is the existence 
Porteg pot Senes, This conclusion is sup- 
affect p Mutations of structural genes which 


ese s 


leave he alteration of protein structure yet 
Conver. € regulatory mechanism unaltered, 
"Bulatop > by mutations which alter the 
Protein > Mechanism but do not alter the 
"eulatoy  Ucture, Tt also seems clear that 
Plasmic * genes can by means of specific cyto- 
Steucty Substances inhibit the expression of 
Tiffen ñl genes, This can be established by 
en tic al Properties exhibited by genetically 
the orig; “Y80tes which depend on the 
that as of the cytoplasm, and by the fact 
Produet absence of the regulator gene or its 


e ro aoe in uncontrolled synthesis of 
e erat at maximal rates. The existence of 
tesso so" aS the site of action of the 
Presso, S deduced from the specificity of 
by tator action, and the identification of the 
= he on a genetic segment is suggested 
ane may stvation that a single operator 
oltcent” Control the expression of several 
wee tng Structural genes, The assumptions 
win ing the interaction of the repressors 
eg Ucers or with co-repressors are the 


4 Dart 
; 1961p) the model (Jacob & Monod, 
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With regard to this latter point, these 
same workers have suggested that the avail- 
able evidence points to the conclusion that 
the active product of the regulator gene may 
in fact be a protein substance, rather than 
the RNA product originally proposed. Several 
lines of evidence support this conclusion: 
(a) The repressor action has a high degree 
of stereospecificity; (b) the interaction with 
the operator is virtually immediate and re- 
versible; (c) genetic evidence shows that a 
single gene, and therefore, presumably a 
single specific macromolecular constituent 
(the repressor) is responsible for the specific- 
ity of the interaction; and (d) single muta- 
tions of this gene abolish the repressor- 
inducer interaction while preserving the re- 
pressor-operator affinity (Jacob & Monod, 
1963). Consequently, Jacob and Monod feel 
that all properties are accounted for if the 
repressor is an allosteric protein which pos- 
sesses two sites, one of which interacts with 
and binds the operator, the other of which 
interacts with the inducer. Consequently, the 
specific effects of small molecule metabolites 
in activating or inhibiting the transcription 
of m-RNA at the gentic level are mediated 
by an allosteric transition of the repressor 
(Monod, Changeaux, & Jacob, 1963). 

The picture which emerges from these 
fragments of fact and theory gives us the 
basis for an entirely different understanding 
of the activity of the genetic material. 
Through the mechanisms mediated by the 
regulator and operator genes, the genetic 
apparatus is brought into continual inter- 
action with the functioning organism. Rather 
than considering the genes in a strictly struc- 
tural orientation, we must begin to think of 
them as continually and dynamically operat- 
ing to regulate the adaptation of the organism 
to its environment. n x 

Considerable care is called for in trying to 
extrapolate present evidence about the genetic 
regulation of cell processes to the function- 
ing of the more highly differentiated and 
more highly organized animal organisms, 
Cellular regulatory mechanisms defined jn 
microorganisms cannot be simply transferred 
to more complex organisms. Higher organ- 
isms must be expected to employ a con- 
siderably wider range of regulatory mecha- 


212 


nisms than are available to the microorganism. 
Yet it is also altogether likely that the basic 
mechanisms defined in the study of E. coli 
can also be found operating at the cellular 
level of adjustment in more differentiated 
organisms. 

Some evidence has already been accumu- 
lated which suggests that similar mechanisms 
are at work in higher organisms. In the en- 
docrine system, it is well established that 
endocrine glands can be activated to increase 
the rate of production of their hormonal prod- 
ucts. The increase of the rate of production 
of some hormones may be an effect of genetic 
control through the activated production of 
m-RNA (Karlson, 1962). Production of 
protein hormones like ACTH might be direct, 
and production of nonprotein horomones 
like the steroids might be mediated by the 
increased production of enzyme systems, The 
thyroid gland, for example, produces thyroxin 
and thyroglobulin. The rate of production of 
these hormones is elevated by the production 
of the thyroid stimulating hormone (TSH) 
from the adenohypophysis. Under the stimulus 
of TSH, the thyroid is activated to increase 
its production. The epithelial cells of the 
follicles increase in size, the cell nuclei and 
nucleoli enlarge and become more prominent, 
and the endoplasmic reticulum shows a 
definite increase. Moreover, active RNA pro- 
duction can be demonstrated by direct 
measurement (Matovinovic & Vickery, 1959), 
This increased thyroid activity increases the 
level of thyroxin in the blood which has a 
feedback effect on the pituitary, suppressing 
the production of TSH. TSH is produced by 
certain basophilic cells in the pituitary which 

are rich in RNA. After suppression by 
thyroxin, the percentage of basophilic cells 
decreases. When the level of thyroxin in 
the blood drops and TSH production is 
stimulated, the percentage of such basophilic 
cells markedly increases (Griesbach & Purves 
1943). The basophils of the stimulated hiv 
pophysis increase in size and the nuclei þe- 
come enlarged, showing several prominent 
nucleoli (Stevens, D’Angelo, Paschkis, Can- 
terow, & Sunderman, 1955). This would 
suggest that TSH acts in the manner of an 
inducer on the hormone producing mecha- 
nisms of the thyroid gland, increasing the 
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production of thyroid hormones. Similarly, 
thyroxin has a feedback inhibition m 
on the basophilic cells of the pituitary. a = 
latter effect may be mediated by cytoplasm 
factors involving inhibition of the enzym 
system. It is not clear whether the site © 
inhibitory action is directly on the basophi 
cells themselves, or on neurosecretory calle at 
the hypothalamus. But even here, morpholog. 
ical indications at least suggest that nt 
protein synthesis is going on, and it 1s a 
possible that genetic regulatory mechan S'i 
are involved. Similarly, in the thyroid cel oy 
is quite likely that TSH is acting throus 
genetically regulated synthetic processes. for 
A strikingly similar case can be made ic 
many of the hormonal interactions a 
regulated the body economy. Pituitary ical 
stimulates the production of adrenocorti 
steroids, and the cortisol thus produce” g- 
an inhibiting effect on the hypothalam 
pituitary sites of ACTH production. he 
similar interaction takes place betwee? 
gonadotropic hormones and the sex hormo 
between luteinizing hormone and the m ; 
ucts of the developing corpus luteum, a 
There is good reason, therefore, to think el 
hormonal action is in many instances 
rected to the genetic sites of regulatio® Ja- 
hormone synthesis, or at least of the regi in 
tion of the synthesis of enzymes involve us 
hormone production. The hormones ck 
produced from the regulated glands feed Pie 
to repress genetic sites responsible for tly 
synthesis of tropic hormones. The freque” pe 
observed specificity of target glands might 
understood in terms of the specificity of 
active sites involved (Zalokar, 1961), 
The understanding of the mechanisms 
action of hormonal control mechanisms 15 r5 
hanced by the work of Knox and othe, 
(Knox, & Auerbach, 1955; Knox, AuerDa 
& Lin, 1956). They have shown that th® ‘of 
jection of the substrate tryptophan 2°” pe 
cortisone produced a similar increase 1". A 
activity of tryptophan pyrrolase in T% “ess 
similar effect was seen in response tO 2 
but the stress effect was not seen in 24° on 
ectomized rats. The effects of adminisy ive 
of tryptophan and cortisone were ade p 
and the administration of Actinomy i, dii 
eliminated the response to cortisone 


nes, 


the 


of 
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not affect the response to tryptophan. The 
e of tryptophan is apparently one of 
dena induction, similar to what we have 
ari fos. in other words, it works pri- 
dle Yy increasing substrate availability to 
au Pip enzyme, thus preventing 
ever is a adaton, Cortisone activity, how- 
NA fr ependent on the production of new 
is inhibited the DNA template since its action 
unction R ; by the action of actinomycin. The 
ore, to i the corticoids would seem, there- 
synthesi € to promote induction of RNA 
enzyme S. Evidence has been found that other 
exampl Systems operate in this fashion, for 
enolpyre „tyrosine transaminase, phospho- 
sible a carboxykinase. It is entirely pos- 
throu a the induction of enzyme synthesis 
type A „the action of steroid hormones is a 
ighly ation or derepression available to 
a actentiated organisms and unavail- 
acob re differentiated microbial organisms 

Tt jg o Monod, 1961b). 
Tegulati an established fact that hormonal 
involved of body processes is intimately 
from g With regulatory processes stemming 
interest 4 nervous system. Increasingly active 
Neuroend, aoe taken in the study of these 
Nowled Ocrine mechanisms, Although our 
Pothala Se of the relations between the hy- 
temely ES and the pituitary is still ex- 
Primary »Pothetical, it seems likely that the 
rr Site of the production of neurohumoral 
hor or gs regulating the production of tropic 
Nes is in the various nuclei of the hy- 
Carried poe and that the neurosecretions are 
adenoh Y the hyphyseal portal veins to the 
This > PoPhysis (Scharrer & Scharrer, 1963). 
back vaig Suggest that the hormonal feed- 
regulate have something to do with the 
Rites of production of tropic hormones 
Connecti cells. There is already a fairly clear 
Ure of i established between osmotic pres- 
diuretj, , € lood and the production of anti- 
nucle,- LotMOne by the cells of the supraoptic 
indicati of the hypothalamus. As a further 
Neura on of the integral functioning of the 
to be and hormonal systems, one cannot fail 
response Pressed by the sensitivity of the 
Stress ` of the pituitary-adrenal axis to 
Drofoy © possibility that such sensitive and 
the god reactions, which affect and involve 
‘Netioning of the entire organism, may 


be mediated by genetic mechanisms at the 
level of cellular regulation brings the genetic 
component of the adaptive process into much 
closer involvement with the moment-to- 
moment dynamics of the organism's function- 
ing. 

It is entirely speculative, but not altogether 
impossible, to begin to think of neural proc- 
esses in terms of the same type of regulatory 
mechanisms. If it can be established that the 
neural impulse is mediated by chemical 
mediator substances such as adrenaline or 
acetylcholine, it is entirely reasonable to 
assume that the production of such sub- 
stances is controlled through enzyme systems 
of one kind or other. The evidence provided 
by the electron microscope that the nerve cell 
possesses an endoplasmic reticulum and a 
protein synthesizing mechanism (Palay & 
Palade, 1955) and the further evidence 
supplied by Weiss & Cavanaugh (1959) that 
the neuron is continually producing cellular 
materials suggests that the nerve cell might 
also function through the control of protein 
synthesis by genetic mechanisms. The further 
study of enzymes involved in the acetylcholine 
system in the generation and transmission of 
nerve impulses will undoubtedly show the 
integral relationship between the activation 
of such enzyme systems and the function- 
ing of the nerve cell (Nachmansohn, 1961, 
1962). 

Interest has also been developing in the 
biochemical aspects of learning. The relation 
between acetylcholinesterase concentrations 
and previous learning has been approached 
experimentally (Krech, Rosenzweig, & Ben- 
nett, 1960; Rosenzweig, Krech, & Bennett, 
1960; Thomas, 1962) and acetylcholinesterase 
levels have also been found to be related both 
to total quantity of stimulation and to genetic 
differences in maze learning ability in rats 
(Smith, 1962). Such changes in enzyme 
levels in relation to neural cell activation 
are presumably under genetic control. It has 
also become apparent that RNA levels are 
elevated in active cells and Hyden (Hyden & 
Egyhazi, 1962) has even been able to demon- 
strate alterations in the specific nuceotide 
composition of RNA after learning. Tt is not 
impossible that the functional activity of the 
neuron, even in higher cerebral functions 
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like learning, brings about structural modifi- 
cations of the participating neurons which in- 
volve cellular regulatory mechanisms at the 
genetic level. A 

The implication of cellular regulatory 
mechanisms for the understanding of more 
complex forms of behavior must remain for 
the present highly speculative. Yet the few 
pieces of the picture which can be sub- 
stantiated by available evidence suggest that 
the progress of our knowledge is forcing on 
us new ways of thinking about the complexi- 
ties of the response of the organism. The 
study of genetic regulatory mechanisms 
brings the genetic apparatus into much clearer 
focus as an intimately reactive component 
of the total ongoing and dynamic adaptation 
of the organism. The dichotomy between 
genetically determined structure and environ- 
mentally determined function can no longer 
be permitted to dominate the thinking about 
the behavioral aspects of genetics. Structure 
and function can no longer be dissociated 
in our thinking, even at the most basic 
levels of molecular organization. It is ap- 
parent, futhermore, that the most basic levels 
of molecular transcription are intimately con- 
nected with, and reciprocally interact with, 
the more complex levels of organization and 
function which characterize the 
life history, 

From this point of view, it is interesting to 
speculate that the development of our think- 
ing about genetic regulatory mechanisms may 
undercut the traditional arguments about the 
etiology of schizophrenia, Recent reviews of 
front-running biochemical candidates (Kety, 
1959) suggest a wide variety of metabolic 
factors which might be involved in schizo- 
phrenic etiology. The structural similarities 
between mescaline and epinephrine suggests a 
defect in the methylation of epinephrine as a 
possibility, Gottlieb and his group (Beckett, 
Senf, Frohman, Tourney, & Gottlieb, 1962) 
have been able to relate primary symptoms to 
a defect in the moblization after stress of 
fructose-1, 6-diphosphate, an important inter- 
mediate in the metabolism of carbohydrates, 

They believe that this is related to a failure 
in the mechanism for the production of ATP, 
the universal energizing intermediate for 
biological activation. Other abnormalities in 


organism’s 
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the metabolism of the aminoacid tryp toph 
have been established (Price, Brown, a 
Peters, 1959). The list can readily B 
tended, but the significant point is that on 
generally agreed that the schizophrenic oe 
ess involves some sort of metabolic a 
turbance. Moreover, it can be pointed sat 
that such metabolic abnormalities are me s 
ated by alterations in the enzyme Ra 
controlling metabolite production. Suc de 
zyme systems are regulated by genetic hele 
anisms, Consequently, the abnormal meta a 
Processes which accompany the schizop ae 
syndrome may be effected either by apea 
tions in the structural genes or by actl In 
tion or inhibition of regulator genes. az 
the latter instance, it is no longer Pon 
sible to regard genetic factors in isola in- 
from environmental factors which can tion 
fluence genetic regulation by way of induc to 
or repression, It is possible in these hag 
think of a sequence of events deriving 1" ry 
the environmental interaction or life ee 
of a schizophrenic subject and terminati 
in the modification or modulation of for 
genetic apparatus, which would account 
metabolic alterations and give rise to dis- 
Symptom complex characteristic of the ion 
ease. It would seem, then, that the isolat! ar 
and validation of genetic influences in pop" 
tions must be subject to a further refinemer 
namely, that the frequencies of abnormalil! 
be shown to derive from the alteration °- 
structural genes rather than from the activ? 
tion or inhibition of regulator genes. esse 

The mechanisms and specific interactio™ 
involved in the genesis of mental illness 1? 
main to be determined, But progress 
science is frequently purchased at the cost ° 
new refinements in our categories of thinkin 
The understanding of the regulatory functi? 
of the genetic mechanism is perhaps one si 
refinement which can contr. 


] s 
ibute to prog", 
in our thinking about the interaction 


genetic and environmental factors in yta- 
etiology of mental illness and in the adal 
tion of the human organism, 


e 
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LATERALITY OF SUBCORTICAL INVOLVEMENT AND 
PSYCHOLOGICAL FUNCTIONS’ 


MANUEL RIKLAN axb ERIC LEVITA? 


St. Barnabas Hospital, New York 


Differences in psychological effects between lateralized cortical and subcortical 
brain lesions were assessed within a neuropsychological framework. The relative 


predominance of specific and nonspecific sys 


ems in the elaboration of incoming 


data at these cerebral levels and their participation in cognition and perceptual 
events was described. It was concluded that bilateral primacy of subcortical 
diffuse systems in contradistinction to unilateral primacy of cortical specific 
Systems may account for observed differences between lateralized cortical and 


subcortical brain lesions. 


thie tent multidisciplinary symposium on 
Nance rey erig relations and cerebral domi- 
angua (Mountcastle, 1962), monographs on 
an disorders (Osgood & Myron, 1963; 
Swill, 1960), and experimental studies on 
intela a the corpus callosum and other 
Myers mispheric commissures (Sperry, 1961; 
in hemi eb, attest to a renewed interest 
hocal a etie differences and behavior. Since 
Speech f (1861) pronouncements relating 
Pervadi unctions to the left hemisphere, the 
SPhere ng belief has been that this hemi- 
Is ‘dominant,” at least for language 
Mteng; es. In the past several years, more 
h ve laboratory and clinical investigations 
“en undertaken to delineate other facets 
to erp bberic dominance and language, and 
tion Baan Such areas of behavior as percep- 
fun ‘ns Cognition in their relationship to 
differences between the brain hemi- 
w 44 ÎS the purpose of this paper to 
tilah reintegrate existing concepts of 
Teveale he role in humans, particularly as 
“Nilater: 1 Y the effects of surgically induced 
Psychol, „Pasal ganglia or thalamic lesions on 
sical functions. 


activiti 


URRE 
RENT Concepts OF CorTICAL LATERALITY 


I . 
ang p? „Current thinking on both laterality 
€Mispheric dominance continues to em- 


1 
Fro 

Nospitar the Psychological Services of St. Barnabas 

an 1, \¢W York City. Supported in part by 


*iitatign | RD-1274-M from the Vocational Reha- 
2 Admin = cree A 
tor hanks aa tration, Washington, D. C 


re expressed to Irving S. Cooper, direc- 
ent of Neurosurgery, for his encourage- 
continuous support. 


hent eDartm 


phasize a language-nonlanguage bifurcation, 
tends to be clinical in nature, and, in many 
instances, utilizes the convenient Wechsler- 
Bellevue Intelligence Scale (WBIS). In an 
early study, Anderson (1951) administered 
this scale to a group of individuais with 
brain-damaged dominant hemispheres and an 
equivalent group of “nondominant” patients 
and reported a significantly greater loss on 
verbal tests for the dominant hemisphere 
group and a correspondingly greater loss on 
performance tasks for the nondominant pop- 
ulation. A number of confirmatory studies 
have since appeared (Bauer & Becka, 1954; 
Costa & Vaughan, 1962; Heilbrun, 1956; 
McFie, 1960; Morrow & Mark, 1955; Reitan, 
1955; Stark, 1961). 

The mainstream of present-day beliefs con- 
cerning the effect of brain lesions on lan- 
guage is exemplified by Bauer and Wepman 
(1955) and Boone (1959), who agree that 
cerebral dominance seems unique to the left 
hemisphere. However, Heilbrun (1956), in 
assessing the effects of right hemisphere 
lesions, failed to fnd differences in spatial 

erformance. This result is compatible with 
two other studies (Battersby, Bender, Pol- 
lack, & Kahn, 1956; Piercy & Smyth, 1962), 
in which spatial representation was found to 
be bilateral but unequal, with differences in 
degree rather than kind. For this function, 
the evidence is very much against strict uni- 
lateral representation (Osgood & Myron, p. 
53). 

In the realm of cerebral dominance and 
language, a similar changing emphasis is also 
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apparent. Instances of spontaneous recovery 
of language following brain injury indicate 
that there must be some replication of func- 
tion “either between the two hemispheres or 
between regions of the dominant hemisphere 
[Osgood & Myron, 1963, p. 49].” Cerebral 
dominance may in all probability be a graded 
characteristic, according to Zangwill (1960, p. 
27), Eisenson (1962), and Critchley (1962, 
pp. 208-214), the last two of whom stressed 
the importance of the right cerebral hemi- 
sphere in language formulation. Critchley 
conceived of speech as a spectrum ranging 
from the “normal” subject, at one extreme, 
through the linguistic pattern of nondominant 
hemisphere defect to a full-fledged aphasia 
from disease of the dominant half of the brain. 
Chapman and Wolff (1959) extended this 
formulation to the “highest integrative func- 
tions” which they found to be directly related 
to the mass of cerebral tissue lost regardless 
of site and side. This corroborates Lashley’s 
(1929) earlier principle of mass action, and 
Hecaen’s (1959) concept of equipotentiality, 
implying that either hemisphere can assume 
the functions of the other, although at a re- 
duced level. In essence, the potential of the 
“minor” hemisphere may not ordinarily be 
utilized maximally except in the case of a 
lesion to the major hemisphere. Analogous 
experimental work involving corpus callosum 
sectioning (Sperry, 1961; Myers, 1961) has 
demonstrated the independence of such surg- 
ically separated hemispheres with respect to 
learning and memory, since separate memory 
trace systems seem to be developed in both 
hemispheres following learning primarily in- 
volving one hemisphere (Ebner & Myers, 
1962a, 1962b). It should be noted finally 
that a report of Doehring and Reitan (1962) 
illustrates the need for differentiating levels 
of behavior in properly assessing the psycho- 
logical effects of lateralized brain lesions, Re- 
views by Meyer (1961), Reitan (1962), and 
most recently Piercy (1964) also emphasize 
a lack of clear-cut distinctions, despite some 
apparent specialization in lateralized func- 
tions. Contemporary psychological concepts 
of functional laterality in humans have been 
derived predominantly from studies of pa- 
tients with insults on their cerebral cortices, 
including epileptics, hemiplegics, brain-oper- 
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ated or traumatically injured. In Mert 
ported cases, obviously subcortica a ia 
were present concomitant with cortic gato 
age. Not all previously described inves ee 
have taken identical viewpoints, and pr ere 
have been made for a theory of graded agestS 
entials. Recent evidence, moreover, rent 
a possible unwarranted equation of ee 
in degree with differences in kind, co of sub 
with a failure to account for the on ah 
cortical structures in psychological beba 


L 

RTICA 

PSYCHOLOGICAL Errects OF SUBCO 
BRAIN SURGERY 


Psychological investigations W i 
taken with individuals undergoing U” 
subcortical surgery, chemothalamect? o 
chemopallidectomy (Cooper, 1956, 1961 
cryothalamectomy (Cooper, 1964), pinson’s 
relief of tremor and rigidity in Par surgi- 
disease. Such operations involve al 
cal production of a lesion in the mesial 8 
pallidus (in chemopallidectomy) or the wio. 
trolateral nucleus of the thalamus (in © ell 
thalamectomy and cryothalamectomy): jte 
roanatomical studies have confirmed i : 
of lesion placement (Caracalos, Lev! ara- 
Cooper, 1962; Cooper, Bergmann, & o, 
calos, 1963; Dierssen, Bergmann, Gio” ires 
Cooper, 1962). Both of these strut etal 
eventually project largely to the ipsila 
cortex, jen 

In an initial study by Riklan, D ts 
Weiner, and Cooper (1960), using the v on” 
both subcortical hemisphere groups der es, 
strated significant decreases in verbal soo Ke 
while only right-brain operatees showed 4 
decrease in performance tests within t 
weeks following surgery. No significant 
ferences in postoperative intellectual eff? 
were found between lesion sites, A retu” 
Preoperative status for the total group ip? 
observed on follow-up assessment, n i 
months postoperatively. These results iw 
peared compatible with Sheer’s (1956) cot 
vestigation involving surgically produce? prs 
tical lesions in which immediate post? ipg 
tive decline was reversed in follow-up te ot 
Moreover, an association between left, 1a, 
sphere subcortical lesions and deficits 1” nd 
guage seemed to be verified. It was later sol 
(Riklan & Diller, 1961) that immediat 


an 
7 or 
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lowing surgery, patients who had had an op- 
eration on the right basal ganglia regressed 
in visual-motor performance, while the left- 
hemisphere group evidenced no like change. 
However, results of the Human Figure Draw- 
ing Test administered before and after chemo- 
Surgery (Riklan, Zahn, & Diller, 1962) sug- 
gested that body-image perception underwent 
Postoperative modification irrespective of side 
of subcortical surgery. It was noted then that 
both hemispheres may contribute to appro- 
Priate body image integration. This finding 
questioned earlier studies relating body-image 
function primarily to the right hemisphere 
(Critchley, 1953; Kolb, 1959). 

Purther exploration into the nature of pre- 
Sumed lateralized differences and thalamic 
ee Was achieved through a factor analy- 
Bengi, 35 variables from the WBIS, the 

ender-Gestalt Test, Human Figure Drawing 
Test and the Rorschach Psychodiagnostic 
fafa atin & Levita, 1964). Observed 
te ta ized differences were primarily in fac- 

Sadings, with greater verbal loadings for 
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TABLE 1 


SUMMARY OF PRE-POSTOPERATIVE 
CHANGE Factors 


(N = 71) 
Left Right 
brain brain 
variance variance 
Change factors (%) (%) 
Verbally mediated cogni- 
tion 19 9 
Nonverbally mediated 
cognition 13 19 
Visual-motor construction 
expansion 12 12 
Perceptual approach 11 12 
Body image integration 11 12 
Ideational productivity 9 9 
Impulse control 9 — 
Emotional reactivity 6 — 
Perceptual integration = 9 
Factors not identified 10% 18% 


Note.—From Riklan and Levita (1964), 


subcortical laterality (Freed & Pastor, 1951; 


e left-brain operatees and higher nonverbal Gillingham, 1961; Krayenbuehl & Yasargil, 
adings for the right-brain group. However, 1961; Orchinik, 1960; Orchinik, Koch, Wy- 
i=) y . . . 
l © apparent qualitative similarity of psycho- cis, Freed, & Spiegel, 1949), or ascribed more 
Bical change factors, as shown in Table 1, definitive differences in functions to the left 
= to the hypothesis that certain psychologi- and right subcortical structures (McFie, 
ene functions might be subserved by systems 1960). k , 
jyulable to both subcortical regions. These Verbally and nonverbally mediated cogni- 
ndings Opened to further consideration tive tasks were used for further elaboration 
Carlier reports which either failed to weigh and clarification of the nature of subcortical 
TABLE 2 
Varntax Rotation or CENTROID Factors FOR LEFT AND RIGHT SUBCORTICAL INVOLVEMENT 
(V = 24) 
Factor | H? 
Tests 7.9 = A 
iR 1L 2R 2 3R 3L R L 

Raven S matrices 86 66 il 20 —.28 13 | a 49 

i ock design 71 71 19 26 AL —.15 | 55 59 

ppedbulary ‘0918 7874 -02 19 2 oi 

TOverbs 13 1B 69 77 —.21 24 54 67 

Digits eS = 13 58 59 56 

Me tal control 32 —07 43 %2 3859 43 36 

esota form hoard 26.60 38 —.23 a OB oo a2 

ormation 02 —.05 o1 40 4 19 | 16 20 

Cmmon variance 39 39 42 39 19 2 | 10 100 


ote 
Adapted from Levita and Riklan (1965). 
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laterality. No significant behavioral differ- 
ences were obtained as a function of lateral- 
ized involvement, based on groups of pre- 
dominantly right-handed subjects (92% 
‘right-handed). Particularly, no differences 
were found in unoperated parkinsonians with 
predominant left- or right-sided neurological 
symptoms, that is, presumably right- or left- 
sided basal ganglia damage (Levita, Riklan, 
& Cooper, 1964). Moreover, in a partial 
cross validation and extension of Riklan and 
Levita’s (1964) original factorial study, a 
battery of cognitive-perceptual tests was ad- 
ministered to a group of unoperated parkin- 
sonians and also subjected to factor analysis 
(Levita & Riklan, 1965). Two major, almost 
identical factors—specifically a visual-spatial 
and verbal discrimination factor for each 
hemisphere group—were found (shown in 
Table 2). In the case of surgery, no substan- 
tial immediate postoperative differences were 
found between left and right thalamic lesions 
as a function of symbolization, whether ver- 
bally or nonverbally mediated (Levita, Rik- 
lan, & Cooper, 1964). Finally, laterality and 
site of surgery were not pertinent to verbal 
and perceptual changes. 

In the light of the foregoing findings, the 
conclusion seems warranted that laterality of 
predominantly subcortical impairment does 
not result in the kind of differences reported 
for cortical lesions with or without some sub- 
cortical involvement. It follows that reassess- 
ment of lateralized aspects of neural organi- 


zation as it pertains to integrative behavior is 
indicated. 


DISCUSSION 
Physiological Considerations 


Recent electrophysiological studies have 
shown bilateral hemispheric changes follow- 
ing unilateral subcortical surgery. In partic- 
ular, electroencephalographic changes imme- 
diately following such surgery have frequently 
shown bilateral decrease in speed of rhythm 
and increase in voltage (Bravo, 1956; Lin, 
1964). The bilateral influence of the basal 
ganglia was corroborated by pathophysiolog- 
ical studies on the role of ipsilateral lesions in 


hemichorea and hemiballismus (Dierssen, 


MANUEL RIKLAN AND ERIC LEVITA 


Gioino, & Cooper, 1961) and the pie 
Penfield and Roberts (1959, pp. 207-2 l 
that a bilaterally distributed subcortical vi 
is responsible for the integration of ante 
Complementary studies involving ne 
stimulation during unilateral men 
surgery also confirmed a contralateral pos n 
as an ipsilateral component for this ree 
Both Ervin and Mark (1960; N E 
Walter, Rand, Crandall, Markham, and aked 
(1963) observed bilateral, copia a 
responses to stimulation of specific as We n- 
nonspecific thalamic nuclei, It is thus saute 
ceivable that specific systems contri ra 
mainly, though not exclusively, to ipsilater 
cortical responses while nonspecific se h 
mediate primarily bilateral reactions, ae 
the manner of interhemispheric transmiss 
is not clear, the 
While the ventrolateral nucleus of ites 
thalamus and the mesial globus pallidus he 
of the surgical lesion project mainly tO tion 
unilateral cortex, the physiological distUP ing 
itself may be bilateral in nature, involv 
(a) the interaction of the specific lesion Tie 
nonspecific subcortical systems and 
cortex, bilaterally, us 
Laboratory studies have yielded analog 
evidence and stressed the importance of “phe 
Specific systems in neural functions. 2”. 
alerting or arousal of the central nervous © 
tem has been attributed to the brain-5 s4 
reticular activating system (Magoun, 1? H 
1963; Moruzzi & Magoun, 1949), A functio 
ally related System, the nonspecific thalan y 
cortical projections, also modulates sens g 
and motor functions of the cortex, accord”? 
to Morison and Dempsey (1942a, 1942D); 
Dempsey and Morison (1942a, 1942b), a 
Jasper (1949; 1960). It is of special ty. 
vance that the bilateral influence of the &. 
fuse thalamo-cortical system has been © 
firmed also in laboratory ' 
Fessard & Rouguel, 1958: Enomoto, 19°, 41 
This bilaterality is not due to cortico-co™ the 
connections, transfer of impulses throug? jor 
Corpus callosum, or the anterior or posto): 
commissures (Enomoto, 1959; Jasper, 1 
More recently a bilateral, basal ganglia "was 
ence upon ascending nonspecific activity 5). 
reported (Krauthamer & Albe-Fessard, 


m 


4 be 
experiments ( 
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Behavioral Considerations 


Research on cerebral activating systems 
aS) Served to focus interest on the alerting or 
activating aspects of behavior. According to a 
review by Samuels (1959), constructs such as 
attention, Perception, drive, and positive and 
negative reinforcement may possess a com- 
i nonspecific activating component in ad- 
ties Tina their specific sensorimotor proper- 
ing sley (1951, 1956, 1957, 1960) added 
Ta n and learning to the behavioral cate- 
esses served by neural activating proc- 
pa t would therefore appear that a more 
AN description of behavior requires 
ge of activating as well as directional 
1959), a Dully, 1934, 1957, 1962; Malmo, 
failure to gaye culatly striking example of the 
irecti 4 differentiate between activation and 
oth B 3s illustrated by emotion, where 
areas a included in a single concept. Other 
t behavior may similarly be analyzed. 
bereet portance of activation or arousal in 
anima] val functions was demonstrated in 
Stimulates in which moderate electrical 
Ceptual ton of diffuse systems facilitated per- 
E; one and discrimination (Fus- 
Perceptual ae 1958). Moreover, the 
Sensory ic Cognitive impairments seen in 
Lindste €privation have been attributed by 
fectin a (1961) to altered sensory input af- 
Sensory Ne ordinary operation of nonspecific 
. Systems, With respect to cognition, 
Dlay F tenting reflex” has been considered to 
Tan, 1 Preparatory, preadaptive role (Raz- 
epres? Sokolov, 1960, 1963). This reflex 
uli Which: an alerting reaction to novel stim- 
trom abe May constitute “the material basis” 
b. 119) ich cognition emerges (Razran, 1961, 
lent to hen May also be considered equiva- 
ler € behaviorists “operant level” (Kel- 
tg r hoenfield, 1950, p. 76). This energiz- 
Hte ae has been related to action of the 

Dti formation. 
org, tt alerting would seem to facilitate 
Ada pas m’s perceptual as well as cognitive 
Ness ey ity. In this context, perceptual readi- 
Mizin described by Bruner (1957) as mini- 
ts € surprise of an organism and limit- 
“Ung, Need for searching. Moreover, for 
Žorizapg Perception involves an act of cate- 
uch categorization or discrimina- 
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tion embraces both perceptual and cognitive 
functions, as exemplified by responses to 
psychological tests. The authors therefore 
propose that such responses incorporate spe- 
cific and nonspecific alerting in combination. 


A Neuropsychological Synthesis 


It was suggested earlier that subcortical 
impairment does not result in the kind of 
lateralized effects seen in primarily cortical 
lesions, the differences, where found, being in 
degree rather than in kind. For both physio- 
logical and psychological events, specific and 
diffuse processes operate concomitantly. Con- 
comitance of physiological and psychological 
events is not meant to imply an identity or 
semantic confusion between these two sub- 
universes of discourse. 

At the thalamic level, diffuse and specific 
systems interact in the elaboration of infor- 
mation. Diffuse bilateral Processes contribute 
alerting or attention components while con- 
current specific processes contribute more 
definitive information, At this level, the less 
completely organized data may require greater 
diffuse activity to achieve elaboration, Within 
the cortical mantle, when further integration 
has occurred, less diffuse driving may be re- 
quired. At this level, too, lateral differentia- 
tion has occurred within a framework of more 
definitive integration. In short, for sensori- 
motor integration, relative predominance of 
bilateral diffuse systems is assumed at the 
subcortical level, whereas at the cortical level 
more specific unilaterally distributed influ- 
ences may predominate. 

Surgically induced subcortical lesions, af- 
fecting data processing at a predominantly 
diffuse stage, might be expected to disrupt 
higher order behaviors regardless of lesion 
laterality. This proposition has been suffi- 
ciently confirmed by the series of subcortical 
studies previously enumerated. On the other 
hand, results of the cortical studies have gen- 
erally shown a greater degree of lateraliza- 
tion. Such results are also compatible with 
our concept concerning greater differentia- 
tion and lateralization at cortical levels, 

The foregoing analysis has emphasized the 
role of incoming data involving both specific 
and diffuse processes. Yet the nature of cog- 
nition and perception is such that outgoing 
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and modulating events also participate. Cog- 
nitive and perceptual behaviors involve all 
levels of sensorimotor integration, and a 
breakdown or lesion at any level would affect 
the entire system and therefore the ultimate 
response. 
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PHYSIOLOGICAL GRADIENTS AND BEHAVIOR’ 
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Physiological gradients accompanying mental activity have been found in 
skeletal-motor and autonomic recordings, commencing with the onset of the 
behavior sequence and terminating at its conclusion. Experimental evidence is 
Presented indicating that these gradients do not signify increasing activation 
(or arousal) during the behavioral sequence (e.g. task or period of attentive 
listening). On the contrary, the EEG evidence clearly indicates that cortical 
activity remains relatively constant during the sequence when skeletal-motor 
and cardio-respiratory levels show progressive rise. While the gradients there- 
fore appear not to represent increasing motivation, or the like, during the 
task, there is strong evidence indicating that the steepness of the gradients 


is a function of motivational level. 


T ; 
pea now extensive evidence from a 
Physiolori of experimental situations for 
Onset a gradients commencing with the 
Perform, a continuous sequence (such as the 
Stor, -22C of a tracking trial, listening to a 
continuin Participating in an interview) and 
lon of n as progressive rise to the termina- 
Presenteq € sequence (typical gradients are 
Purposes of Figures 1 and 3). One of the 
dence S of this paper is to review this evi- 
these | Another purpose is to consider what 
centra adients may mean with regard to 
ic nervous system (CNS) and auto- 
Wnderlyi ervous system (ANS) mechanisms 
arly sh organized behavioral sequences. 
"eviews by related to this topic have been 
Btadien, Bartoshuk (1955b). Experiments 
With o ients in our laboratory commenced 
Servations on psychiatric patients. 


ys 
TOLOGICAL GRADIENTS IN CLINICAL 
RESEARCH 


first study (Malmo, Shagass, & 
» 1951), electromyographic (EMG) 


hi 
‘ates Penne was supported in part by United 
Sa con ic Health Service grants, a United States 
Bowtical abe and grants from the National and 
arg oe esearch Councils and the Defence Research 


Tn 4 
Davig he 


Scrip t 


gradients were clearly evident in the tracings, 
showing progressively rising EMG voltage 
(i.e. increasing muscle tension) from the 
onset to the termination of a mirror drawing 
test, usually with marked fall in tension at 
the completion of the drawing. These gradi- 
ents were so very striking in the muscle po- 
tentials that (as will be obvious later in this 
paper) for some five years experiments de- 
signed to study physiological gradients in re- 
lation to performance (later with nonpatient 
subjects) concentrated on muscle potentials. 
To anticipate a later section of this paper, 
when a number of other physiological trac- 
ings were included in this kind of study (be- 
ginning with the one by Malmo & Davis in 
1956), most showed gradients, so that in the 
discussion of EMG gradients that follows it 
should be kept in mind that these gradients 
are not peculiar to the skeletal motor system. 

It may also be noted that even prior to 
1956 Malmo and Shagass (1952) had found 
that psychoneurotic subjects showed a blood 
pressure gradient during a mirror drawing 
task (not shown by normal controls), and 
similar findings were also reported in this 
article for blood pressure change during a 
series of paced size discriminations. 

There were also a number of EMG gradi- 
ents that appeared in these clinical studies. 
Davis and Malmo (1951) obtained gradients 
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from the frontalis muscle during interviews 
lasting an hour or more in a patient whose 
chief complaint was severe headaches which 
she referred to as bandlike sensations en- 
circling the head just above the eyes and most 
intense over frontal and temporal regions. In 
one especially stressful interview the tension 
in the forehead reached a particularly high 
level, and the patient actually complained of 
headache. Supportive measures introduced by 
the therapist at this point were followed by 
a sharp drop in forehead tension. 

In a case with severe tensional discomfort 
at the back of the neck, Case 2 in Shagass 
and Malmo (1954), neck EMGs showed a 
gradient, again with complaint of headache 
when the gradient had nearly reached its 
highest level. EMGs from the limbs (com- 
bined) also showed a gradient, but frontalis 
EMGs failed to do so. The agreement of locus 
of the somatic change with the patient’s sub- 
jective report (e.g., frontalis EMG gradient 
in the patient with complaint of discomfort in 
the forehead and neck EMG gradient in the 
patient with complaint of headaches in the 
neck) is evidence for the “principle of symp- 
tom specificity” which has been stated in 
general terms by Malmo and Shagass (1949) 
and which is closely related to the principle 
of autonomic response stereotypy stated by 
Lacey and his co-workers (for a review, see 
Lacey & Lacey, 1958). Other examples of 
cases of this kind are cited in Malmo (1959, 
see Fig. 6, p. 382, and accompanying text 


for description of a symptom-specific EMG 
gradient that appeared during a tracking task 
carried out under stressful conditions). 


GRADIENTS DURING CONTINUOUS LISTENING 
UNDER CONTROLLED EXPERIMENTAL 
CONDITIONS 


In recording physiological reactions from 
psychiatric patients during interview there 
are a number of uncontrolled variables. First 
of all, with psychiatric patients as subjects it 
could have been that the gradients were 
simply another manifestation of psychoneu- 
rotic overresponsiveness (see Malmo, 1957, 
1959; Martin, 1961); in the second place, 
the gradients could have been due in large 
part at least to some kind of cumulative effect 
of very stressful interview content. 
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That such conditions were not essential f 
the appearance of frontalis’ EMG gad i. 
was demonstrated in an experiment by z : d 
lerstein (1954). This investigator emp va 
normal subjects who listened to an inter . 
ing detective story presented by a ek 
corder, and during the period of ei 
(about 10 minutes), frontalis and gin ae 
sion showed progressive rise, falling soit of 
conclusion of the story. In a second a to 
the same experiment the subjects listen Ret- 
an excerpt from Kant’s Critique of Pu’ "EMG 
son, and again during listening frontalis 
gradients were obtained, though they any- 
smaller in magnitude than those goo not 
ing the detective story, and tension di sat 
fall abruptly upon its termination, 35 
the case at the end of the detective age 

No significant increase in forearm "letec 
tension was found in listening to the of the 
tive story, but during the first hearing ise 0 
essay there was significant tensional ra gs 
the forearm. Wallerstein cites similar aD 
in an earlier study by Smith, Malmo rms 
Shagass (1954) in which tension in the aig. 
rose significantly during listening tO 4 pur 
tively uninteresting article which ha! aces 
posely been made difficult to hear in P“ he 
by decreasing the volume control 


jnt 
recording apparatus, at intervals, to ê ost 
where subsequent reproduction was “f the 
inaudible—although it could be hear e 


subject made an effort. In both articles sp- 
significantly increased forearm muscle p° ul 
tials were interpreted as reflecting eff e 8 
listening. “Straining” to hear, in which "° nl 
made of the skeletal muscles, is a comm pis 
observed phenomenon, but to the best © pas 
author’s knowledge no clear formulatio” the 
ever been made concerning what purpos? jn 
muscles can serve in such cir 


c cumstance™ gi- 
dealing later with interpretations of 8" ọn 


ents in general, some light may be throw 
this question. jent 
In his interpretation of EMG grad jes 
from facial (chin and frontalis) mur epis 
Wallerstein suggested that these 814C cing 
may have been “associated with incre pine 
comprehension or organization of iP” pay 
verbal material—organization whic we! A 
take place during attentive listening p 
lerstein, 1954, p. 238].” Tt is impo 
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note that in his interpretation of the chin 
and frontalis muscle gradients, Wallerstein 
poms Hebb’s earlier approach to motiva- 
= (1949, P. 228 ff.) prior to the formula- 
on of his “arousal” concept (Hebb, 1955). 
A gual, experiment by Bartoshuk 
oe ) again employed the same detective 
Malis normal subjects listening. But in 
ies, = to EMGs, frontal and parietal EEGs 
Waller . recorded. Combined results from the 
Steen a en and Bartoshuk experiments 
ta a the demonstration of significant 
ete EMG gradients during listening. 
nes i correlations between gradient steep- 
by B nd reported interest were interpreted 
iter nme as supporting his hypothesis 
i from Wallerstein’s, 1954) that 
indicat. of forehead EMG gradients “were 
Fee ie of motivation to listen.” In the 
quent caring of the story (but not in subse- 
fiche! were of it) the steepness of 
lated t EMG gradients was inversely re- 
tWeen © EEG amplitude. This relation be- 
EMG gradients and EEG on the first 


Cari S 

(loss Was interpreted in terms of Hebb’s 

Brad arousal function,” the steeper the 
ent the higher the arousal. Barto- 


s 
gradid 56, p. 161 f.) explained that it is the 
arousal as a whole which is related to 
the rigi level, and he did not conclude that 
alleleq mg tension during listening was par- 
Y growing interest in the story, 
ECessa or that to test this it would be 
Sections” to divide the story into several 
interest? each of which could be rated for 


Pointing 


sua thermore, Bartoshuk demonstrated with 
Compone EEG data (in which the alpha 
ĉan ERG was most prominent) that the 
2G voltages themselves failed to show 

With © @dients as would have been the case 
yy, Pr ostessively greater cortical activation 

ac € period of listening. 

ERGs of correlation between EMGs and 
as į on repeated presentations of the story 
Ting tetPreted in accordance with the ter- 
arousa of Hebb (1955) as indicating that 
he o function was less a factor in repeated 
ings ng and that the gradients on later hear- 
ofe ight be understood entirely in terms 
tion © function, The nature of this cue func- 
ect was not specified. Bartoshuk sug- 
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gested that further analysis of relations be- 
tween cue and arousal functions might be 
more profitably done with tasks in which 
performance level could be used as a measure 
of cue function and in which the subject’s 
motivational level (“arousal function”) 
could be more easily subjected to experi- 
mental manipulation. 


PHYSIOLOGICAL GRADIENTS IN PERCEPTUAL 
Motor Tasks 


Again, for summary of the early work re- 
lated to this topic see Bartoshuk (1955b). 
Following the work in our laboratory with 
mirror drawing in psychiatric patients there 
was a series of studies with normal subjects 
beginning with Smith’s (1953) experiment. 
Using mirror drawing, Smith obtained EMG 
gradients confirming the findings of Malmo, 
Shagass, and Davis, (1951). Smith’s inter- 
pretation of these gradients in principle seems 
very similar to Wallerstein’s (1954) interpre- 
tation of gradients during attentive listening. 
Smith’s hypothesis was stated as follows: 


the observed increase in muscle tension during 
drawing reflects the developing organization of a 
central neural process, which controls and “antici- 
pates” the overt activity, and is yet relatively inde- 
pendent of immediate sensory stimulation [1953, 


pp. 34-35]. 


Bartoshuk found significant relations be- 
tween steepness of EMG gradients with 
amount of increase in residual tension fol- 
lowing task interruption (1955a) and with 
speed and accuracy of mirror drawing 
(1955b), which he interpreted as supporting 
a motivational hypothesis of EMG gradients. 
Surwillo (1956), employing a visual tracking 
task, showed that by varying incentive he 
d alter the steepness of EMG gradients. 


coul 5 3 
sk that was invariant 


Moreover, by using a ta 
in trial duration he introduced a control that 
was absent in Bartoshuk’s experiments where 
trial duration varied according to drawing 
speed. 

Surwillo also brought some support to 
Bartoshuk’s notions concerning EMG gradi- 
ent steepness being related to level of per- 
formance. Data from subjects performing 
two tracking trials under conditions of very 
high incentive were fractionated in the case 
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of each subject according to whether the 
tracking score was the better or the worse for 
these two trials. EMG gradients for the left 
forearm muscles (i.e., from the arm not en- 
gaged in tracking) turned out as predicted 
with the steeper mean gradient corresponding 
to the better performance. Since this positive 
finding appeared in only one of the five 
muscles studied, Surwillo did not feel that 
the Bartoshuk hypothesis was strongly sub- 
stantiated. However, the difference in incen- 
tive between the two trials which he selected 
for study was probably not very great (this 
kind of comparison will be dealt with later 
when some recent unpublished data are pre- 
sented), and the particular muscles showing 
the most pronounced gradients appear to vary 
from one kind of task or experimental situa- 
tion to another. For example, forehead 
EMGs that showed the best gradients during 
listening have been found inferior to other 
muscles in showing gradients in mirror draw- 
ing (see Bartoshuk, 1955b, p. 219; Malmo 
and Davis, 1956, Table I). 

A very important observation made by 
Surwillo (1956) and later confirmed by Sten- 
nett (1957) was that grip pressure on the 
knob used in tracking actually fell during 
tracking, Therefore, the EMG gradients 
cannot be accounted for merely in terms of 
progressively increasing muscular exertion (in 
the context of force applied to the knob) 
during the task. This finding appears to have 
been overlooked by Eason (1959) and by 
Eason and Branks (1963) in their interpre- 
tation of EMG gradients. Eason (1959) 
failed to obtain EMG gradients under condi- 
tions in which force applied to a dynamome- 
ter declined during the trial. However, his 
failure to obtain gradients was very likely 
due to the simplicity (the low “mental” de- 
mands) of the task (squeezing a hand dyna- 
mometer). Here, Bélanger’s (1957) findings 
appear relevant: Bélanger’s subjects in a task 
requiring a series of rapid visual discrimi- 
nations were required to press a button with 
their index fingers each time a judgment was 
made. Under these conditions EMG gradi- 
ents were obtained. But when all they had 
to do was to press the button at the same 

intervals without being required to make any 
discriminations, no EMG gradients appeared. 
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Eason’s (1959) findings appear similar a 
these control results of Bélanger; and coe 
shuk’s interpretation concerning EMG gr be 
ents and motivation therefore appears to os- 
better supported by the facts than the Rana 
ing interpretation of Eason and Br ro- 
(1963, p. 541), which holds that the pro- 
, ? gnifie 
gressive increase in EMG actually sig 
increasing level of activation. 1Gs 
That gradients were not confined to pen 
but occurred also in other physiological the 
tions seemed a reasonable inference SOR i 
earlier work with psychiatric patients. i 
was confirmed with normal subjects in Davis 
ror-drawing experiment by Malmo and t rate 
(1956). Gradients were found in heat ad 
(HR), blood pressure, and respiration İ 
dition to those in muscle tension. t 
Finally, some unpublished tracking da t 
addition to providing some new informe 
confirm the Malmo and Davis (1956) and 
ings of HR and respiration gradients 
Bartoshuk’s (1956) failure to obtain peri- 
gradients. A full description of the T i 
ment from which these data were take ne 
Presented elsewhere (Malmo, in press). by 
tracking apparatus has been describe gents 
Davis and Mundl (1964). Figure 1 pre gres 
EMG gradients for the left pronator he 
muscle, showing a gradient steeper ani 
best trial in four than for the worst led 
These four trials came from the so-c® 
unified set condition in the experiment een 
Malmo (in press). The difference betW he 
mean left pronator teres EMG levels for ata 
low and high distance-off-target (error) 
was significant (p < .025, one-tailed). 
Figure 2 presents the corresponding 
formance data. Note that in contrast t° 
regular rising gradients in Figure 1, the £ 
curves show fluctuations and that the 
error curve, at least, fails to show any ° 
all gradient. The three peaks in error 5° 
occur close to the 24-second point ™ pe 
tracking cycle which would correspond i ped 
place where the subject, having just Te?" jn 
the stop, was forced to reverse directio er 
rotating the tracking knob (from co" rom 
clockwise to clockwise). It is clea yes 
these data that the regularly rising “er 
in Figure 1 cannot be accounted for et 
of increasing or decreasing mean ett 


a, in 
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LEFT PRONATOR TERES EMG ž 


"i Low enno 


hig canon 


MEAN MICROVOLTS 


kiá 10 is 20 a 


TIME IN MINUTES, 


; Comparison of left pronator teres EMG 

r high- versus low-error trials in manual 

umbers on the baseline represent the 4- 

Sration intervals that were selected for 

nt. Each point is based on the mean of 

: woo from 35 subjects. Data from 7 sub- 
© Unavailable for this analysis.) 


tracking, (N 
Second inte, 
Measureme 
Measur 
ject, 


ir 
diners of any progressive change in task 
Pi Y within trials), 
and sure 3 presents gradients for respiration 
Very rep, The HR curves in particular are 
ittegulary lar and fail to show any trace of 
in niy corresponding to the fluctuations 
© error curves in Figure 1. The differ- 
Error oe mean HR for low- and high- 
(p Nditions was marginally significant 
differen >? one-tailed). The corresponding 
< Pa for respiration was significant 
Fig, >? Me-tailed), 
1730 T° 4 presents curves for EEG in the 
ftom „yles per second (beta range) plotted 
useq p © Same integration intervals as those 
“ures 1 and 2; and the absence 


MIGH ERROR 


MEAN DISTANCE OFF TARGET (DOT) IN DEGREES 


s oe 20 ~ as 


TIME IN MINUTES 

ti tong Mean error curves are plotted from same 

the tegration samples as those used for plot- 

Mean “Urves in Figure 1. (These data are based 

ects distance-off-target (DOT) scores from 40 

thiş „êta from 2 subjects were unavailable 
analysis.) 
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Fic. 3. Gradient curves for HR (N=40) and 
respiration (M =35). (Points on the base line are 
based on means for successive sevenths of the 
tracking trial. Data for 2 subjects were unavailable 
for HR, and data for 7 subjects were unavailable 
for respiration analyses.) 


of a regularly rising gradient is quite evident, 

As previously mentioned, Bartoshuk 
(1956) showed by means of quantified data 
that EEG gradients failed to appear in a 
listening situation in which forehead muscle 
gradients clearly appeared. Bartoshuk’s 
filters were so constructed that the high 
voltage alpha frequencies dominated the pic- 
ture. The data in Figure 4 provide further 
negative data, this time showing absence of 
gradients in EEG data from the beta range. 
Furthermore, the results of quantitative 
analysis of EEG alpha from the present 
study (in which an 8-12-cycles-per-second 
filter was employed) confirmed Bartoshuk’s 
earlier finding with regard to absence of EEG 
alpha gradients. In the high error condition, 
21 of 40 subjects (i.e., only 52.5%) showed 
an increase in voltage from the first to last 
point on the curve. Under the low error 
condition there were 18 of 38 (only 47.37%) 
of the subjects showing such an Increase. 
These percentages are clearly not significant 
from chance (50%). 

There appears, therefore, to be no instance 
of EEG voltage in any of the frequency 
bands studied showing progressively increas- 
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Fic. 4. Curves for mean 17-30 cycles per 
second. (EEG values are plotted in the same way 
as those for the data in Figures 1 and 2. Each point 
is based on data from 35 subjects. Data for 7 
subjects were unavailable for this analysis.) 


ing activation. This should not be particu- 
larly surprising since there is no evidence on 
other grounds for interpreting the gradients 
as signifying gradually rising activation level. 
That steepness of gradient and EEG indices 
of activation are significantly correlated 
under certain circumstances was demon- 
strated by Bartoshuk (1956), who, as pre- 
viously mentioned, found very significant 
inverse relations between mean EEG voltages 
and forehead EMGs during the first hearing 
of a story. This of course means that during 
the first hearing, subjects with steep EMG 
gradients showed the typical, low-voltage, 
“activated” EEG, whereas subjects in whom 
such steep gradients failed to appear showed 
the typical “nonactivated” EEG pattern (see 
Bartoshuk, 1956, Figure 3). Absence of cor- 
relations between EMG and EEG on later 
hearings of the story was interpreted by 
Bartoshuk as indicating that gradient steep- 
ness was multiply determined, that activation 
level was an important factor in determining 
gradient steepness on the first hearing of the 
story but not a significant factor when the 
same story was repeated immediately after- 
wards. In suggesting what one of these other 
factors might be, Bartoshuk singled out the 
cue function as postulated by Hebb (1955). 
EEG data in the study herein reported 
(see Figure 4) did not show a significant 
difference between high- and low-error condi- 
tions. It should be pointed out, however, 
that the mean difference between these two 
conditions was relatively slight. For low 
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error the mean off-target score was .21 de- 
gree compared with .28 degree mean seorg 
for high error, a 334% increase. c 
pared with this, the difference between = : 
and double tracking was much greater: - 

degree mean score for single and .42 E 
for double tracking, an increase of 68% E 
Malmo, in press). Here it should be sieren 
that because of the relatively slight diten 
entiation provided by the high- versus ar 
error comparison, no very large and F Hi 
nificant differences should have been expec jo- 
between conditions for the peripheral P 
logical functions (EMG, HR, and respiratlo 

which showed gradients. in 

The chief importance of this study was jal 
showing that with no significant ie 
trend in performance, and in the press 
of rising EMG gradients, there were no 
gradients. Interesting trends may appe@ wi 
EEG over time as MacNeilage* has sho a 
but these trends in EEG observed by M 
Neilage were not found associated with E) ic 
gradients so that there is really no co” 
between his findings and mine. hat 

If it is steepness of the gradient t Í 
under certain circumstances, gauges le 
activation within a performance trial no 
during listening and the like) and it B oA 
the case that activation increases pari P A 
with the EMG gradients, then there ÍS 7. 
reason really to expect EEG to show a 8" at 
ent. As a matter of fact it appears 
gradients, when observed at all, seem 
appear in measures of “background” effe 
action, and EEGs, of course, do not fit 
this category. 

Of 17 studies, where they were looked uk, 
EMG gradients were found in 16 (Bartost ig 
1955a, 1955b, 1956; Bélanger, 1957; D 
& Malmo, 1951; Elliott, 1964; Malm? po 
press, 1959; Malmo & Davis, 1956; Me i 
& Shagass, 1952: Malmo, Shagass, & 
1951; Malmo & Surwillo, 1960; Smith, 
Stennett, 1957; Surwillo, 1956; Waller et 
1954). Failures to observe EMG g'a% ia 
in adults have been relatively rare- onead 
Neilage (see Footnote 3) recorded foi 
and right pronator teres EMGs in his ther 
and he observed no gradients from 


r in 


to 
ctor 
jnt 


for, 


tei 
5! ts 


c 
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3P. F. MacNeilage. Personal communica! 


PHYSIOLOGICAL GRADIENTS AND BEHAVIOR 


i iing a paced auditory serial-addi- 
t tine - It would be of interest to repeat 
SE gd nea recording from 6 to 8 mus- 
4 i a Soe to determine whether some mus- 
) N than the ones recorded from in 
i nad experiment might show gradi- 
Write =a subjects used their right hands to 
Other pri their serial additions, and from 
1957 Fis les (see, for example, Stennett, 
from’ the left 7 and 3) it seems that EMGs 
ising v eft ( passive’ ) arm would be prom- 

il nes to examine for gradients. 
uine to obtain EMG gradients in young 
typical pr experimental conditions where 
tecently p lents were observed in adults was 
Prom the Ported by Elliott (1964, p. 24). 
“that t ese and other data Elliott concluded 
Old chil P muscle tension response in 6-year- 
tional „C2 does not have the same motiva- 

Significance that it has in adults.” 

is j a of repeated sessions on gradients 
of an ed of further study. From the results 
Malm, oPetiment on sleep deprivation by 

© and Surwillo, it appears that in the 
Sradtents x exceedingly strong incentive, 
l Stee tom certain muscles may decline 
Ma mo Pness during tracking trials (see 
That o © Surwillo, 1960, Figure 6, p. 13). 
Cline Fradient steepness does not simply de- 
task w long continued practice on a motor 
219) “8S shown by Bartoshuk (1955b, p- 
anq & O° mirror drawing, and by Malmo 
of ry willo (1960, p. 19) in their findings 
response specificity during sleep 


deny: 
“Privation 


PaLMar Conpuctance (PC) 


I 

loge pontrast to gradients for other physio- 
Ones Measures, gradients for PC are falling 
igure 1° typical finding is illustrated in 
Strong | Of Stennett’s (1957) article. The 
a task PC reactions occur between trials 
there «> Such as tracking. During the trial 
that S typically a slow return to base line 
Peay duces a falling curve. Generally 
tiong È PC appears very sensitive to transi- 
bein ? that is, to shifts in the kind of activity 
Vitos Performed or to sudden changes in en- 
"ations tal stimulation, For recent corrobo- 
ang H Of this point see Elliott (1964), 

arter, Eason, and White (1964) who 
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cite earlier studies also confirming this gen- 
eral conclusion. 


POSSIBLE NEURAL MECHANISMS UNDERLYING 
PHYSIOLOGICAL GRADIENTS 


At the present stage of knowledge only 
very tentative suggestions can be made in 
attempting to account for the observed phe- 
nomena. As this review has indicated, the 
observations themselves have on the whole 
been consistent and reproducible (surpris- 
ingly so in view of the rather different situa- 
tions in which they have been made). To 
summarize the main findings, progressively 
rising gradients have been observed in re- 
cording skeletal muscle tension (often from 
muscles not engaged in activity) and in 
recording cardiovascular and respiratory 
changes. These gradients have been found to 
accompany periods of sustained attention, 
commencing with the onset of the period and 
rising monotonically to the end, usually with 
a pronounced drop at the end. The time 
courses of the gradients have depended on the 
duration of the behavioral sequences, from 
20-30 seconds in the case of some mirror 
drawing trials, for example, up to 10 minutes 
in the case of hearing a story presented by 
a tape recorder (and, as previously stated, 
even longer gradients, up to an hour or more, 
have been obtained from interviews with 
psychiatric patients). mei 

Again, the evidence indicates that the 
rising gradients do not signify increasing 
motivation, degree of interest, or the like, 
during the course of the behavioral sequence. 
Instead, the indications are that this in- 
creasing “tonic” background is somehow part 
of the whole physiological activity mane 
required for sustaining a relatively even er 
of attention from the beginning to the = 
of the task (or other sequence of menta 
activity). 


At the present stage of neuropsychological 


knowledge, going beyond the statement con- 
tained in the last paragraph is frankly specu- 
lative. However, in order to advance on the 
neuropsychological problem, a certain amount 
of speculation is required (to bring further 
experimental work into focus). i 
The theoretical discussion by Miler, 
Galanter, and Pribram (1960) provides a 
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good point of departure. They said: “A Plan 
is any hierarchical process in the organism 
that can control the order in which a 
sequence of operations is to be performed 
[p. 16].” They went on to say: “The 
execution of a Plan need not result in overt 
action—especially in man, it seems to be 
true that there are Plans for collecting or 
transforming information, as well as Plans 
for guiding actions [p. 17].” There seems 
little doubt that the unit of behavior which 
they described is identical with the unit 
which in our studies is demarcated by onset 
and termination of physiological gradients, 
An interesting parallel between the Miller, 
Galanter, and Pribram analysis and ours 
comes out in their discussion (p. 66) of the 
consequences of task interruption. This dis- 
cussion serves also to bring into the same 
framework the important work of Lewin 
(1935) and his associates. Early in the series 
of experiments on the physiological gradient 
phenomenon Smith (1953) found in a mirror 
drawing task that muscle tension dropped 
significantly more after completion than after 
interruption of the task. This finding fits 
nicely with Lewin’s hypothesis that task in- 
terruption serves to continue or prolong a 
“tension system.” Furthermore, there is an- 
other important area of agreement between 
the viewpoints of Lewin, Miller, Galanter, 
and Pribram and ourselves. In order for a 
tension system or “physiological gradient” to 
manifest itself, the task cannot be too simple 
and repetitious. For physiological gradients 
this was clearly shown in the aforementioned 
experiment by Bélanger (1957). On this 
main conclusion there is agreement between 
the three points of view, though it should be 
pointed out that there is not complete agree- 
ment between prediction from Lewinian 
theory and from the theory of Miller, 
Galanter, and Pribram (1960, p. 66). 
Again, on a frankly speculative basis (but 

drawing on relevant facts where they exist), 
we may raise questions concerning what 
neurophysiological mechanisms could be re- 
flected in the physiological gradients that we 
have been observing. As a starting point we 
may note the conclusion of Miller, Galanter, 
and Pribram (1960) that: 
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Neurobehavioral studies conducted on animals =e 
port . . . [the] notion that the limbic systems a 
the internal core of the forebrain play an essentia! 
role in the execution of Plans [p. 204]. 


They went on to make further, more detailed 
suggestions (admittedly speculative) : 


on 
The hippocampus could perform . . - [a] Boe 
function via the reticular formation in the — 
core of the brain stem, which, in turn, 15 fae ys 
to influence the receptors, the afferent pot 
into the central nervous system, and the ge 
of the entire external portion of the forebrain. 

the hippocampus may be intimately involved in she 
business of keeping the brain at work On 07]. 
successive steps in the Plan... [pp- 206-2 


To bring the discussion back to the Pa 
on physiological gradients, the EEG eviden 
indicates that cortical activity is relatives 
constant while cardio-respiratory and E’ sk 
activity is progressively rising during the g 
or other behavioral sequence. . that 

Probably the most likely notion iS coe 
adaptation or fatigue is responsible for t 
differential reaction, at least in part.” ical 
not unreasonable to assume that caran 
neurons and circuits are more susceptible 
adaptation than the more peripheral parts ce 
the motor system (for a related inferen: 
see Hebb, 1949, p. 226). Then, if there A 
a homeostatic feedback mechanism that kee 
activation level relatively constant during a, 
Sequence where sustained attention $ |. 
quired, this will involve gradually risi”& er- 
tivity of part of the arousal system to ° ” 
come the reduced sensitivity of the “pigh? 
circuits. The affluxive part of this mechan gr 
would presumably be the ascending retic! g 
activating system (ARAS). From the phy 
logical (gradient) evidence, it would eit 
that the descending branch of the reti” jp 
system is also brought into play, Bear” of 
mind the probably greater sensitivit pal 
cortical circuits, if the same arousal F tbe 
is fed via ARAS and also downward YÍ? js 
descending reticular facilitatory mec 7 i 
in the brain stem, this signal (while ™ ype 
taining a relatively constant condition }” ing 
cortex) will produce a gradually incre ay 
peripheral tonus because of the lesse" nis 
*Thanks are due P. M, Milner for suggesti® 
idea to me. 
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In Sensitivity to bombardment in the more 
Peripheral parts of the motor systems. 
Because EMG gradients have been so fre- 
ie found in passive muscles (see Figure 
: example), the question arises whether 
a ee fatigue concept might be pro- 
al e which gradients would be produced 
es oe muscles, However, quite 
cin th MG gradients have been recorded 
example, Poove arm in motor tasks (see, for 
» Bartoshuk, 1955a, 1955b; Surwillo, 
» ruling out any general application of 
an explanation, 


Regarding details of the more peripheral 


Such 


sma te, Bartoshuk (1955b, 1964) has 
spinal Maas Probable importance of supra- 
circuits ‘mulation of muscle-spindle feedback 
Supporting accounting for EMG gradients. 
sive pie this idea is the finding of exten- 
ties een of sensitive foci in the 
Muscle « formation capable of influencing 
Puii Spindles in the cat’s hindlimb (Eldred 
Hori, 1958). Furthermore, with regard 

` Petipherst oe advanced earlier concerning the 
atig al part of the system being less 


al . 
Mue it has long been known that the 


Fulton 
Whil 
ealt 
Contr 


pindles are relatively nonadapting 

» 1949, p. 16). 

© up to this point the discussion has 

Specifically with effects on circuits for 

argum, of skeletal-motor activity, the main 
ents introduced appear applicable to 


e : 
isus nomic effector mechanisms (for a 
Sion related to this problem see Malmo, 


2 P: 22 ff.). 
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EFFECTS OF CHLORPROMAZINE ON LEARNING 


LAWRENCE C. HARTLAGE? 


Department of Mental Health, Louisville, Kentucky 


Learning is an essential part of behavior therapies and is an important require- 
ment of more traditional psychotherapy. The widespread use of tranquilizing 
drugs in psychiatric settings prompts a review of their effects on learning. 
Chlorpromazine is perhaps the most widely investigated tranquilizing drug 
and is used as a referent in the assessment of its effect on learning. Results 
of studies involving a number of animals, normal subjects, and psychiatric 
patients tend to show significant declines in learning on a wide range of tasks, 
with a linear decline in learning with increased dosage levels. A number of 
methodological difficulties which limit generalizations are discussed. 


“ee Widespread use of tranquilizing drugs 
1 Psychiatric settings (Baron & Fisher, 
makes relevant an investigation of the 
Of these agents on the learning of the 
Pact ae being treated. In light of the im- 
of ps the conditioning therapies on the field 
1964 Ychotherapy (Wolpe, Salter, & Reyna, 
carnin and the inevitability of the need for 
More ng of some sort to take place during 
(Mun traditional forms of psychotherapy 
indiy © 1955), the conditionability of the 
idual treated by any psychotropic agent 
of 4 5 an important factor in the outcome 
Englign Po The first published report in 
Chothe On the effectiveness of the psy- 
ing de Peutie effectiveness of a tranquiliz- 
Denier” involved chlorpromazine (Delay, 
et, & Harl, 1952), and by 1956 nearly 
articles on this drug had been pub- 
Bennett, 1957), Since that time, 
4,200 articles dealing with chlor- 
mine have appeared in the Psycho- 
aa blogy Abstracts, making this drug 
t ics the most investigated single psycho- 
“i agent (Freyhan & Merkel, 1961). As 
becom (1962, p. 300) points out, it has 
are oC à Standard against which all others 
Ompared, 


ffect 
Indiyj 


ary 


Metnopotocrear CONSIDERATIONS 


in ot importance of methodological issues 
mph, field of psychopharmacology has been 

el asized as essential to the future of the 
Tngiow 


na at Larue Carter Hospital, Indianapolis, 


ay y 
à Pret to L. B. Miller and R. P. Smith for 
Mary reading of the manuscript. 


discipline (Cole & Gerard, 1959), and meth- 
odological issues have figured prominently 
in recent publications in this field (Dews & 
Morse, 1961; Hunt, 1960; Uhr & Miller, 
1960). Unfortunately, this has not always 
been the case: when Bennett (1957) re- 
viewed nearly 1,000 published articles deal- 
ing with psychopharmacology, he concluded 
that only 10 had employed proper controls. 
In light of inconclusive evidence concerning 
locus of action, global neurochemical effects, 
and specificity of function, perhaps the most 
fruitful approach to the problem at this time 
may lie in the investigation of specific behav- 
joral indices of learning related to adminis- 
tration of the drug. 

As Russell (1964) has pointed out, it is 
quite possible to study empirical relations be- 
tween input, that is, chemical stimulation via 
drug action, and output, that is, changes in 
behavior, disregarding the intervening events. 
This stimulus-response type of operational 
relationship permits generalizations about the 
effect of a given drug on a specified behavior 
without the necessity of postulating any func- 
tional interactions of the biological systems 
involved. Within such an operational a 
work, it is possible to regard a drug as t : 
independent variable and some measure 0 
learning as the dependent variable. 

Psychopharmacologists have typically recog- 
nized that the temporal order or sequential 
effects of drugs might influence their results, 
and so have often alternated or balanced the 
orders of presentations with a “crossover” 
design (Barmack, 1940). This design is a 
special case of the “Latin square” design and 
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requires that all interactions be zero, a re- 
quirement which, as McNemar (1961) points 
out, seriously restricts its use in psycho- 
pharmacology research. In the Latin square, 
therefore, all interactions are omitted from 
the analysis, and the effects of these inter- 
actions become indistinguishable from the 
variations of the response being studied and 
so become “confounded” with experimental 
error (Nash, 1960, p. 133). The requirement 
that all interactions be zero is probably 
seldom met, since, as Kornetsky, Humphries, 
and Evarts (1957) demonstrated, there is a 
typical Drug X-subject interaction due to 
personality differences in reactivity. Further, 
drug-induced changes in learning capacity 
may readily be confounded with drug effects 
within this paradigm, For example, the drug 
may impair the capacity of subjects who are 
first tested in the drugged state to understand 
the task set before them (Jellinek & McFar- 
land, 1940). 

Another major methodological problem is 
that of a suggestion factor, As Rosenthal and 
Frank (1956) point out, this is often 
sufficient to affect the Outcome, and their 
findings have repeatedly been supported 
(Beecher, Keats, Mosteller, & Lasagna, 
1953; Jellinek, 1946; Lasagna, Mosteller, 
von Felsinger, & Beecher, 1954). Although 


the “double blind” model provides a control 


for this, there are typically enough cues from 
side effects to transi 


cend such controls (Baker 
& Thorpe, 1957). 


The problem of res 
haps related to that 
Fleischman (1953) 
Hempel (1954, 1955) 


ponse sets is one per- 

of suggestibility. As 
and Fleischman and 
have pointed out, these 
factors underlying test performance may 
shift during the course of testing, making any 
findings equivocal and inconclusive, None of 
the measures of learning in the field of psy- 
chopharmacology appear to have exerted any 
controls over this problem, so the extent 


to which findings may be confounded by 
response sets is moot, 


RESULTS 
Although superficially it would appear that 
there is a great deal of discrepancy among 


reported results of the effects of chlor- 


promazine on learning, a close examination 
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of these reports typically indicates war m 
apparent discrepancies are due to lac a3 
specification or control of one of the parat 
ters involved. For example, some E 
gators (Kovitz, Carter, & Addison, re 
Shaten, Rockmore, & Funk, 1956) stu A 
the effects of chlorpromazine in poe 
patients have generally concluded that ¢ ig- 
promazine either improves, or does not a 
nificantly impair performance. Kome ET 
(Kornetsky, Humphries, & Evarts, ce 
contrary results with similar tests of thet 
formance in normals were, after ma 
investigation, found to represent per? 
in dosage levels, chronic versus acute a veen 
istration of the drug, and variations UE ets 
various types of subtests as well as onic 
ences between normals and schizop ae y, 
(Kornetsky & Humphries, 1958; Kornets 
Pettit, Wynne, & Evarts, 1959). nses 
Animal Conditioning. Conditioned repor e 
have been found to be impaired in @ fait 
variety of infrahuman species, ranging sage 
pigeons to monkeys, under various ape 
levels of chlorpromazine. Cook, Kelleher, jlo- 
Fellows (1962), using dosages of 5-30 rah 
grams per kilogram of body weight (mg/K8 
found depressant effects on key presar a); 
pigeons. Ferster, Appel, and Hiss (e 
with dosages of 1-10 mg/kg in pie? ner 
found a decrease in response rate at ma dle 
dosages but an increase in lower and M! t 
dosages. Rahmann (1962) demonstrated t 0 
with hamsters dosages of .5, 1.0, and tly 
mg/kg impaired conditionability significa? - 
more than a placebo administered to a © ri- 
trol group; and even after learning tO r 
terion, there was much less resistanc® f 
extinction in spite of the greater numbe! 
reinforced trials. Isolated studies with © 


ts 
(Pautler & Clark, 1961), dogs, and rabbit 


(Sadowski & Longo, 1962) produced sit es 
decrements in conditioned responses. Dee) 
ranged from 1.9 mg/kg (Paulter & ee 
to 225 mg. Working with mice, Jacob vant 
Blozovski (1961) identified another rele. 0, 
variable. Using dosages of 1.6, 2.5, 2-5 ip 
and 3.5 mg/kg, they found significant ing 
pairment of time required for condition ity 
and with dose levels of 10 mg/kg, due fur- 
of conditioning was also decreased. The? oye 
ther found that conditioned response’ 


ts 
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deleteriously affected 24 hours after dosing, 
even when they had not been affected at the 
= of administration of the drug. Rosen- 
BI oh Savaris (1963) found decrements in 
a ee responses one hour after dosing 
am ‘mg/kg, and Clark (1962), using 
sei eee from 16 to 45 mg/kg, found 
i impairment in conditioned motor re- 
osa, T This may be due to the rather high 
L e used, however, since White and 
alert eke (1961) found a change in EEG 
high Se to deactivation patterns with 
RG Oncentrations of the drug, and similar 
aa | ae were reported by Everett (1961) 
celli (19 ard, Bert, Zwingelstein, and Giudi- 
a moto 61). Further, Posluns (1962) found 
May h r deficit at higher dosage levels which 
ing me accounted for the lack of respond- 
repo n Wolf, Swinyard, and Clark (1962) 
ted Similar findings. 
body is be expected, perhaps the largest 
Specie: ol reported research on subhuman 
Frank has been done with rats. Gatti and 
incre (1961) found that a 2-mg/kg dose 
ased the latency of a conditioned reflex 
minutes after administration, abolished it 
it tee after administration, and returned 
Irwin ĉar normal after approximately 6 hours. 
ity, 22d Armstrong (1961) found that con- 
signin a locomotor activity on a treadmill was 
a cantly depressed by chlorpromazine 
ome wed an increasing depression over 
teporteg  ,êdministration. Finocchio (1963) 
conditi Similar results with a lever-pressing 
tion o ned response, The dimension of dura- 
(1965, Pset was demonstrated by Izquierido 
Deritona using 2.0-2.5 mg/kg injected intra- 
OF er (ip). He found an inhibition 
Ute. tioned responding within 10-20 min- 
Pressig oren (1961) found a consistent de- 
Mg/k A m conditioned responses with 1-3 
befo o Mjected intraperitoneally 10 minutes 
(l9 e the initiation of the session. Heimstra 
Suppr, » Using oral administration, found no 
ours Geom in conditioned response after 2 
© me but Gonzales and Shepp (1962) found 
testin 8 (ip) 2 hours and 4 hours before 
tioned had completely supressed the condi- 
to responses. The rate of onset appears 
toy, Substantially faster for ip administra- 
foung Turphree, Monroe, and Seager (1962) 
a significantly lower survival rate 


among infant rats whose mothers had been 
given 5 mg/kg of chlorpromazine regularly 
during pregnancy. It is difficult to say 
whether this was due to a physiological 
factor or a loss of adaptive behavior, although 
human infants whose mothers had been on 
comparable dosages during pregnancy showed 
no conclusive evidence of physiological 
changes (Kris, 1962; Werboff & Kesner, 
1963). Bindra and Mendelson (1963) found 
differential effects of chlorpromazine on 
higher and lower levels of conditioning. He 
reported a decrease in response rate at higher 
levels of training, but not at the highest 
level. Aaronson (1963) has criticized the 
commonly used pass-fail criteria, on the 
ground that it would be difficult to demon- 
strate changes at the extremes of response 
measurements; Bindra’s results may be some- 
what affected at the highest level by this 
criterion artifact. Plotnikoff (1961) found 
that inbreeding increased resistance to the 
effects of chlorpromazine on conditioning 
measures, and Aceto, Lynch, and Thomas 
(1961) found that fasted rats produced 
greater reactivity to the drug, making studies 
which do not control for these variables 
open to question concerning the effect these 
factors might have exerted on outcomes. 
Schieckel (1962) found a curvilinear rela- 
tionship between dosage levels and condi- 
tioned delay behavior in monkeys. Using a 
range of from .31 to 1.0 mg/kg, he found 
large doses produced a significant decrease 
in the limit of delay, whereas small and 
medium doses resulted in increases in limit 
of delay. Weitzman and Ross (1962) found 
that .3 mg/kg produced marked fluctuations 
in responses, and Clark, Jackson, and Brady 
(1962) reported a general decline in condi- 
tioned behavior after administration of chlor- 
ne. P 
a lass of conditioned response. of special 
interest to psychopharmacologists is the con- 
ditioned emotional response. The conditioned 
emotional response has certain face validity 
as an animal analog of human psychiatric 
disorder (Hunt, 1960). Through learning, a 
neutral stimulus acquires the power to evoke 
an intense emotional disturbance, one greater 
than that which the reinforcing shocks could 
evoke initially (Estes & Skinner, 1941). The 
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ease of establishing the conditioned emotional 
response (CER) and of controlling its 
strength experimentally, together with an 
ostensible relevance to human emotional dis- 
orders, has favored its use as an indicator 
in a number of studies of conditioning and 
of the effects of tranquilizing drugs on 
nditioning. 

a (1986) and Jernberg (1957) have 
found that heavy doses of chlorpromazine 
have marked effect on both expression and 
extinction of the CER, but interfere only 
slightly with acquisition. Halasz (1959) how- 
ever, found that moderate doses facilitate 
the extinction of avoidance and conditioned 
fear in the cat. Appel (1963) found that the 
conditioned emotional response in rats was 
inhibited with a dosage of 1.5 mg/kg, and 
Kinnard, Aceto, and Buckley (1962) found 
that dosages of 1.5-3.0 mg/kg produced the 
attenuation of the CER, thereby perhaps 
providing some clue as to the widespread 
acceptance of the drug as effective in psychi- 
atric settings (Gallant, Bishop, Steele, & 
Noblin, 1963; Hunter & Stephenson, 1962; 
Klein & Fink, 1963; Mann, 1963). 

Heise and Boff (1962) have demonstrated 
that the conditioned avoidance model is 
superior for sensitivity to treatment effects 
over a number of other approaches (Holliday 
& Dille, 1958; Jindal, 1961; Randall, Heise, 
Schallek, Bagdon, Banziger, Boris, Moe, & 
Abrams, 1962; Weaver & Miya, 1961). 
Barry and Miller (1962), in an approach- 
avoidance study, found that 2-4 mg/kg re- 
duced approach-avoidance speed in rats, and 


similar conclusions were reache 


d by Harvey 
(1962). Barry (1962), in a well-controlled 


study, tended to corroborate these findings. 
Posluns (1962) found that 3 mg/kg produced 
a delay in acquisition of conditioned avoid- 
ance responses, as did Geller, Kulak, and 
Seifter (1962) with dosage levels of 25}, 25 
1.0, and 3.0 mg/kg. Irwin (1962) found 
that the rate of tolerance in avoidance 
training was influenced by three variables: 
(a) females developed tolerance faster than 
males, (b) the intensity of the conditioned 
stimulus, (c) the intensity of the uncondi- 
tioned stimulus. Experimenters have not 
typically equated these factors in avoidance 
studies; therefore, direct comparisons are not 
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likely to permit any generalizations ee 
Sekino, & Greifenstein, 1962; Ekstra) z, 
Sandberg, 1962; Feldman & Lewis, ohai 
Geller, Kulak, & Seifter, 1962; Jacon, 
Blozovski, 1961; Randall et al., 
Steward, 1962; Wolf et al., 1962). A 
One major problem in psychopharm on an 
concerns the effect of a given Se be- 
organism’s discrimination of its = m this 
havior, or the sensory feedback A Keller, 
behavior (Berryman, Wagman, & plishing 
1960). These investigators, after ~~" rats, 
a stability criterion of performance pefore 
injected them with .2 mg/kg ip 1 POT damin 
the experimental session. The initia ticeable 
istration of the drug produced a no with 
deterioration of discriminative ane pro- 
subsequent administration the next nS first. 
ducing a much greater effect than mA that 
This study led the authors to conclu wou! 
chlorpromazine at a .2-mg/kg level jmina 
initiate a considerable change in ee Jine. 
tion against a previously established bas ave 
The tendency for chlorpromazine tO same 
successively greater effects for the ted bY 
dosage if injected daily was substantia s 7 
Olds and Travis (1960), using levels | 
mg/kg with rats. With human sU jou 
Mirsky, Primac, and Bates (1959) uced 
that 200 mg. of chlorpromazine prO uous 
significant impairment on the Conti" g 
Performance Test of discriminant reS scio 
ing, and Shurtleff, Mostovsky, and DiM 200 
(1962), with dosages of 25, 50, 100, am aina 
mg., found significantly impaired discri” 
tion of tones at all frequencies. per 
Learning in Humans—Normals, A 2“. + 
of studies have been conducted which 
some light on the general effects of © 
promazine on various measures of learn med 
both normal subjects and institution id 
mental patients. Studies on normals ots 
generally impaired performance in ¢? ipê 
ranging from the effects of chlorpro™ 4a, 
on a single bullfighter in the ring (Mort ef- 
1961) to a multivariate evaluation of it ect? 
fects on psychomotor and cognitive tskY? 
in a large number of subjects (Korne isk? 
Humphries, & Evarts, 1957), The Korn eae" 
Humphries, and Evarts study found Masing 
ing impairment on these tasks with inc" nd 
amounts of chlorpromazine. Kornets 


Jor" 
in 
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Humphries (1958) found that a single dose 
Produced a greater decrement on psycho- 
poton, performance in normals than in schizo- 
petites Schneider and Costiloe (1957) 
e it increased reaction time, and Bur- 
a (see Uhr & Miller, 1960) found it 
ee a decrement in key tapping and 
Simila the critical flicker fusion frequency. 
Pollack decrements | in CFF were found by 
Fink (] Karp, Krauthamer, Klein, and 
(1961) and Karp and Pollack (1963). 
ie, Balanov, and Kaufman (1958) 
activit indications of impaired conditioned 
and nd under chlorpromazine, and Schneider 
Rae (1957) found that it inhibited 
a > loning as well. Latz (1963), found 
loner dosage levels produced significant 
ig ie ae in sustained attention, and that 
Perform, Oses resulted in significantly impaired 
Psycho ance on attention, cognition, _and 
(1961) fen tasks. Although Ludvigson 
conditi found no significant differences in 
A Saai eyelid responses between chlor- 
ist ine (25 mg.) and placebo groups, he 
estin, stered the drug orally 1 hour before 
in Wo and there is reason to suspect that, 
si ‘ge at least, a substantially Jonger 
(Benner eet for results to be noticeable 
agland. & Hamilton, 1963; Clark, Ray, & 
enea Ws? 1963b). A study showing no differ- 
age en chlorpromazine and placebo on 
by Wen times and tapping rate was reported 
tdon zel and Rutledge (1962). Mirsky and 
On perfi (1962) found a marked impairment 
eir i mante, and were able to validate 
the tesults (Mirsky et al, 1959). Using 


u 

Snel] dge (1962); DiMascio, Havens, and 
foung 1961), in a well-controlled study, 
Mental significant psychomotor inhibition, 


Wett Confusion, and sedation. Evans and 
Teactio (1962 ) found it increased disjunctive 
(1959)" time, and Schaacter and Wheeler 
(1969) found it inhibited humor. Uhr 
Studies concluded that the majority of 
(eg involving the stronger tranquilizers 
ordina lorPromazine) impaired speeded, co- 
eptual 2 Psychomotor, and simple per- 
atema, Skills, Loomis and West (1958) 

r a to assess the effects of chlor- 
Con azine on a common activity and chose 


tioned driving performance as their de- 


239 


pendent variable. Using a simulated traffic 
condition, they found significant impairment 
in driving. 

Psychiatric Patients. Results of research 
with chlorpromazine on hospitalized psychi- 
atric patients show, for the most part, gen- 
eralized decrements in a wide variety of 
learned behavior, and the few that show 
improvement typically do so at the expense 
of proper experimental controls. Ray, Rag- 
land, and Clark (1962) gave Digit Symbol 
and Purdue Pegboard tests to patients during 
and after chlorpromazine treatment, and, 
finding no differences between these scores, 
concluded that the patients learned certain 
skills on drugs which were not lost when the 
drugs dissipated. This conclusion is not en- 
tirely tenable, however, since they ignored 
the base rate of predrug functioning level. In 
a later, well-controlled study (Clark, Ray, & 
Ragland, 1963a), they found stable signifi- 
cant differences, with earlier impairment on 
intellectual and psychomotor functions, but 
general effects requiring 8-16 weeks to be- 
come statistically significant. The long period 
of time that they felt was required to ob- 
serve noticeable changes was also empha- 
ized by Bennett and Hamilton (1963). A 
number of investigators have used the 
Wechsler-Bellevue scales to measure the ef- 
fect of chlorpromazine on various subtests. 
Nickols (1962) reported a number of studies 
which indicated that placebo groups produced 
higher gains, especially on timed subtests, 
than chlorpromazine groups. Presumably, the 
gains occurred as practice effects due to 
familiarity with the (alternate form) test, 
and so might be considered a fair index of 
learning. Vestre (1961) found that chlor- 
promazine lowered both learning and reten- 
tion, and Helper, Wilcott, & Garfield (1963) 
found similar results with emotionally dis- 
turbed children. Mitchell and Zax (1958) 
found that conditioning was impaired with 
chlorpromazine, and Saucer (1959) found 
that it raised the threshold for perception 
of apparent motion. Similar impairment of 
psychomotor and conditioning performance 
was reported by Kornetsky et al. (1959), 
and Porteus (1957a, 1957b) reported a de- 
cline in performance on maze tests. 

Perhaps the ultimate criterion for the ef- 
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fect of drugs on learning capacity in psychi- 
atric patients would be the effect of such 
agents on the adaptive capacity of medicated 
individuals in societal roles. Ellsworth and 
Clayton (1960) studied 200 successive psy- 
chotic patients admitted before drug therapy 
and 200 admitted after drug therapy. When 
matched for variables considered relevant, 
there were no statistically significant differ- 
ences in either the length of hospitalization 
or the return rate of the two groups. 
Although the drug-treated patients left the 
hospital slightly sooner than those treated 
in the predrug area, they tended to return 
more rapidly than the predrug patients 
(31.3% of drug patients and 25.3% of pre- 
drug patients returned within one year). A 
year later Tuteur, Stiller, & Glotzer (1961) 
found that socioeconomic and other compli- 
cating factors interfered with the reintegra- 
tion of the discharged patients being treated 
with chlorpromazine into society, 

Aware of the problems involved in adjust- 
ment under drugs, Mason, Forrest, Forrest, 
and Butler (1963) emphasized the need for 
dosage adjustment and attention to psycho- 
social factors to help the patient meet the 
environmental stress of community living. 


DISCUSSION 


It is in the higher nervous system that 

human beings differ most from all other 
species of animal, so great caution must be 
stressed on making facile generalizations 
from infrahuman organisms to man. Many 
drug effects, such as those on individual dif- 
ferences, interpersonal relationships, and ef- 
fects modifiable by experience, are essentially 
human variables which are difficult or impos- 
sible to isolate in lower species (Uhr & 
Miller, 1960). This may also be true, to a 
lesser extent, of the use of hospitalized psy- 
chiatric patients for behavioral testing. As 
Reitan (1960) has pointed out, there seems 
to be a large body of opinion in the field 
that normal subjects are better for such test- 
ing than patients, largely due to differential 
base-line considerations. 

A statistical problem arises in the pooled 
testing of a number of variables in a single 
battery. With 20 such variables, chance 
factors alone should make one variable sig- 
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nificant (.05) assuming that all een ae 
independent. Such a “significant i o 
would have to be considered the eE 
chance, unless a replication were done i 
that single variable. This one y 
graphically demonstrated in the r hr 
Kelly, Miller, Marquis, Gerard, and op 
(1958a, 1958b) in an extensive investig o 
of the effects of chronic ape 
drugs on 51 behavioral measures with 
after each administration. They ee o 
tests for drug effects, plus 2,400 m 
significance of correlations between pers ine 
ity variables and drug effects and ppe as 
a number of significant findings. Bu have 
these were not clearly more than might e 
been expected by chance, it was conc pa 
that no dependable drug effects had 
found. 5 Ss 
With a patient population there 1s 4 adh 
tion concerning whether or not the Short 
viduals actually took their medication. rfect 
of parenteral injection of the drug, Pe ake. 
control is a questionable assumption to ee 
To sustain a constant level of the wanes 
effect of a drug so that a battery of use 
can be given, an investigator may either al- 
a repeated injection method or spansules, g- 
though neither adequately insure the aot 
nance of a given drug level. As NeW tra 
(1941) pointed out, maintaining conce” the 
tion of a drug at a uniform level i? ion 
blood, or even at the sites of drug act of 
does not in itself insure that the effect n 
the drug will be maintained at a un! “a 
level, Further, the possibility of short 
adaptation of the central nervous SYS", 
should be kept in mind when subjects et; 
examined for longer periods of time (Beee 
1952). ties 
A number of rather pervasive difficult 


oa ing 
make any definitive statements concer es 
the out at 


w 
any 


g administration, and of ups 
some have used base lines of atypical ee ot? 
There is rarely any control for such fä re 
as institutionalization, When normals ost 
compared with patients there is an i that 
universal tendency to ignore the fac 


f 
b 
U 
f 
153 
f 
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Most patients are hospitalized whereas nor- 
mals (by criterion) are usually not. It is 
Perhaps a truism that few investigators in 
ide take into account the assumptions 
ias various statistical tests. It is not 
Be eo to see 20 subjects put into cells of 
and ion » With chi squares computed between 
Fees Pig all cells, for example, with a dis- 
e effe a both the requirements for x? and 
iha E | such multiple testing on the 
ember, T A typical study was that of 
(1962) wh eller, Rajotte, and Kaufman 
tu who used counterbalanced orders of 
'8S and a placebo on a total sample 

8 patients, 
ae ee assumption is that all “normals” 
w pn: without the substantiation of this 
a xa lly, Tt is extremely unusual for 
Subjects hen to ascertain that his normal 
Compari ave no brain damage, even when 
ween © the effects of drugs on learning 
“Volunte normals and organics. Further, the 
Bien Subject, on whom many psycho- 
jec ological studies with “normal” sub- 
atypical on has been found to be rather 
1948. ie a number of variables (Bellak, 
> Hollister, Traub, & Beckman, 1956; 
19 Saa, 1955; Lindemann & Malamud, 


` 


) von Felsinger, Lasagna, & Beecher, 


Dertosa ugh the effects of anxiety on learning 
there a ance are known (Spence, 1953), 
able 0 PPears to be little control for this vari- 
8 ting Pa than in studies specifically investi- 
Tayl r (Holliday & Dille, 1958; Spence & 
conditi Sl, 1953). The improvements in 
Bators e oiif reported by some investi- 
i Part Sing lower dosage levels may be due 
paired” the alleviation of anxiety which 
for 4 Performance. Ferster et al. (1962), 
"ates Pr reported increases in response 
osaga, 2 pigeons under lower and middle 
Simi i With a decrease at higher dosages. 
(1955, Tesults were reported by Schieckel 
Sump we monkeys, and Kornetsky and 
es "es (1958) found that under low 
Schizophrenics showed much less 
eae than normals in various cognitive 
It wechomotor tasks. 
ap ld appear that, in spite of the sub- 
numbers of studies investigating 
Omazine, there has not been a con- 
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clusive, parametric investigation of the ef- 
fects of this drug on learning, with controls 
for all variables found to be relevant. In the 
absence of such investigations no generaliza- 
tion to global situations is justiñed, but the 
nearly unanimous findings of the limiting 
effect of chlorpromazine on such a wide 
variety of learning measures does seem to 
support a conclusion that, since chlor- 
promazine is consensually considered the drug 
of choice for treatment of schizophrenic dis- 
orders (Freer, 1962), such treatment must 
be willing to sacrifice some degree of learning 
on the part of the patient. 
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VISION IN THE GANZFELD 


LLOYD L. AVANT? 


Kansas State University 


A summary of the evidence on exposure to structureless visual fields is pre- 
sented. The data show the experience of such fields to be characterized by 
reports of: immersion in a “sea of light” which separates into figure and 
ground as brightness is increased, chromatic adaptation in colored fields, loss 
of efficiency in detecting the presence and movement of inhomogeneities intro- 
duced into the field, disorientation of the observer, an increased and fluctuat- 


ing state of accommodation, and the occasional 
cessation of function of the visual mechanis 


in the brain. 


The advent of high-altitude supersonic 
flight and space travel have added to the theo- 
retical concern for the Ganzfeld a practical 
interest in the effects of a structureless field 
upon vision. The loss in efficiency of the 
visual mechanism in the Ganzfeld is similar 
to that of high-altitude flight. This similarity 
is exemplified by the fluctuation and increase 
in accommodation characteristic of “empty 
field” myopia for the Ganzfeld observer and 
“sky” myopia in the case of high altitude 
flying. The purpose of this paper is to review 
and discuss available evidence on the effects 
of exposure to structureless visual fields, 

In 1930 Metzger seated observers 1.25 
meters distant from a carefully whitewashed 
4 x 4 meter square surface from which wings 
extended toward the observer on three sides. 
He illuminated the field with a neutral light 
and asked the observers to verbalize their 
experience of this field. Metzger termed this 
field the Ganzfeld. Gestalt theory, holding 
that primitive vision is binocular rather than 
monocular, would predict for Metzger’s ob- 
servers an experience of tridimensional, rather 
than bidimensional, space, Metzger’s data 
favored this Gestalt prediction, and the con- 
tribution of Metzger’s work to Gestalt theory 
is reflected in Koffka’s (1935) statement of 
Gestalt thinking. 

Metzger’s results did not, however, com- 
pletely support the Gestalt position, and 
Metzger’s query, as well as his new method 
of studying visual perception, went unnoticed 


1 The author wishes to thank Harry Helson and 
William Bevan for their critical reading of this 
paper. 
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joint occurrence of an apparent 
m and increased alpha activity 


for some years. The revival of interest 19 = 
experience of the Ganzfeld has ee. 
largely during the last decade. More = a 
research has shown that, with regar d the 
number of visual functions or Se ady 
Ganzfeld produces an experience mP ote 
different from the experience of strae Eo 
fields. Varying the brightness and/or, o 
maticity of the field or the introduction; 

spots differing in brightness or chromal™ e- 
from the remainder of the field produces 

sponses to such variations which differ ure 
responses to these variables in struct an 
fields. The observer’s accommodation itu- 
orientation are affected in the Ganzfel seats 
ation, and the visual mechanism even PP eld 


at times to cease functioning without 
structure, 


PHENOMENAL CHaracteristics OF a 
GANZFELD tral 
Illuminating the Ganzfeld with 9. 
light, Metzger (1930: cited by Koffka, wi” 
found that the observer felt “himself 5% sre 
ming in a mist of light which becomes 
condensed at an indefinite distance- rer 
tically all subsequent investigators tig 
ported a like experience for the obse!” och 
the Ganzfeld, Impressions reported bY ` sych 
berg, Triebel, and Seaman (1951) were? 
as to be subsumed under the term rade 
“cloud.” According to Gibson and wa of 
(1952) the experience was that of @ ensin 
light,” with some observers at a oat i 
something vaguely surfacelike in perv 
the face. Cohen’s (1956, 1957) a fel 
found it difficult to apply to the emP 
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CET the language usually adequate to 
“Sea visual experience of structured fields; 
of light” seemed most descriptive for 
most of these observers. 
a poa experience includes surfaces and 
concaye p aped three-dimensional surface” 
Tribe] Pe ao the observer (Hochberg, 
effect” op o anan, 1951), a “cracked ice 
1957) mai web-like structure” (Cohen, 
and Pes hd ae of vitreous opacities 
1962). Mni, ood vessels (Miller & Hall, 
gra ations = m Ludvigh (1960) reported 
erent flee ante and shade, rings of dif- 
S stimulati shapes, and when homogene- 
lon of other sense modalities was 
the experience of the Ganzfeld, 
allucinations resulted (Bexton, 
Tha’ È Scott, 1954), 
duced, polation, or distance, of the fog pro- 
termined, the Ganzfeld is not accurately de- 
the ho - Cohen’s (1957) observers described 
although Scneous field as “close at hand,” 
Among ob he meaning of “close” varied 
Consistently we Only one of these observers 
than ¢ |Y described the field as being more 
tas wA away, and the modal distance 
a indef les. However, the fog extended for 
(195) nite distance. Gibson and Waddell 
the St pail out the voluminous nature of 
pete Mg a of the fog. Their report is in 
asis for with other investigators and is the 
“Stecment Gibson and Waddell’s tentative 
OF the with Metzger’s findings in support 
ture estalt belief in the tridimensional na- 
Gibso Primitive visual experience. However, 
: conclusi Waddell stressed caution in such 
perje, ON because observers agree that the 
Mbiey © is indefinite, indeterminate, or 
ty ell ha, Neither Metzger nor Gibson and 
of ke TORS &lven definitive evidence of the 
Primitiy, Onal or three-dimensional nature 
Various e experience. 
Altereffects of exposure to homo- 
Stimulation for a period of 20 min- 


Observers had difficulty maintain- 
Time perception was disturbed. 
complained of dizziness and 
S appeared to be intoxicated. One 
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observer experienced temporary states oi 
depersonalization, a condition which Cohen 


did not define. 


BRIGHTNESS VARIATIONS IN THE GANZFELD 


The “mist of light” experience described 
by Metzger’s observers prevailed as long as 
the illumination of the whitewashed surface 
remained “below a certain level.” Increasing 
the illumination produced new experiences. 
As the illumination was increased 


the fog became condensed into a regularly curved 
surface which surrounded the observer on all sides; 
its appearance was filmy like the sky, not surfacy, 
and, similar to the sky, it was slightly flat in the 
center [Metzger, 1930, p. 13; cited by Koffka, 1935]. 


Further increase in the illumination caused 
the surface to straighten out into a plane, the 
apparent distance of which was definitely 
greater than that of the real one. The expla- 
nation of this result offered by Koffka was 
that the increased illumination allowed the 
observer to accommodate to the microstruc- 
ture or “grain” of the surface, and there was 
no longer homogeneous stimulation. Under 
Metzger’s conditions, these results are not 
surprising. It is of interest to note, however, 
that Metzger’s results tend to support Ittel- 
son and Ames’ (1950) contention that ac- 
commodation is to apparent rather than to 
objective distance. Metzger’s observers, given 
sufficient illumination to accommodate to the 
microstructure of the viewed surface, saw it 
as farther away than was actually the case. 

Ittelson and Ames varied the apparent dis- 

tance of playing cards by changing the size 

of the cards; they found, with a constant 

distance for cards of different sizes, that ac- 

commodation and convergence varied with 


apparent distance. 
Cohen (1956, 
duced the Ganzfeld 


btained different i } 
ee of brightness variations. His Ganzfeld 


was produced by two adjoining spheres, A 
and B, 1 meter in diameter, which, at their 
juncture, produced an 8-centimeter aperture. 
Wearing a special, mask which also formed a 
removable section of the wall of Sphere A, 
these observers looked monocularly „into 
Sphere A. By controlling the chromaticity, 


1957, 1958a, 1958b) pro- 
differently from Metzger 
indications of the 
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purity, and luminance of the two spheres in- 
dependently, the experimenter could produce 
a homogeneous field or a field containing a 
background (Sphere A) and an 8-centimeter 
spot (the juncture of Spheres A and B). In 
the homogeneous field Cohen found that vari- 
ations in the intensity of the field did not 
consistently alter the appearance of the field. 
There was, however, some tendency for the 
fog of brighter fields to appear denser and 
closer. 

With a field made inhomogeneous by the 
introduction of the spot, he found that varia- 
tions in brightness differences between field 
and spot within the range 1.8 to 4.9 milli- 
lamberts produced no statistical evidence of 
a relation between intensity of illumination 
and the density or closeness of the fog. This 
lack of agreement with Metzger’s findings 
Cohen attributes to the microstructure of 
Metzger’s field which was absent in his own. 
With regard to the separation of field and 
spot, Cohen found that when the gradient 
between field and spot is not steep the fog 
seems nearer but the spot seems farther 
away. Further increase in the intensity 
gradient between field and spot reverses this 
appearance; the fog recedes, and the spot is 
seen in front of the field fog. At a maximum 
intensity difference between field and spot, 
judgments of figural distance ranged from 
“next to the eye” to 3 feet away with a 
median of 6 inches. The ground was usually 
judged a few inches behind the spot. Cohen 
reports great individual differences between, 
but consistency within, distance judgments 
of his observers. In addition to these effects 
of independent variation of brightness, this 
investigator found that when inhomogeneity 
was initially produced by chromatic differ- 
ences the figure-ground separation thus es- 
tablished was enhanced by the addition of a 
brightness gradient. Cohen concluded that the 
phenomenal characteristics of the Ganzfeld 
are not independently determined, He main- 
tained that any change in the stimulus dis- 
tribution that modifies one aspect of experi- 
ence produces concomitant changes in other 
phenomenal characteristics. of the Ganzfeld. 
The most recent study of the Ganzfeld 
(Weintravb, 1964) indicates that Cohen’s 
conclusion is incorrect, and that intensity of 
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stimulation and the colorimetric purity of = 
Ganzfeld are independent dimensions. W oe 
traub’s results appear to disagree with a 
well-established principle of color ponyo a 
(Helson, 1938), certainly with regard er 
chromatic effect of brightness variation. © E 
conflict is discussed in the following section- 


ELD 
CHROMATIC EFFECTS IN THE GANZF 


peri- 

Questioning the necessity of the pna 
ence of a mid-gray as the end result of e: 
longed homogeneous stimulation, as requ 
by Hering’s opponent processes ten 
vision, Helson and Judd (1932) const" apet 
an “adaptation sphere” of Hering p its 
suspended a 100-watt Mazda lamp sphere 
center, and had observers sit with the mit 
encircling their heads for as long as 7 matic 
utes. They report that whatever cht Z he 
adaptation took place occurred gunka 
first few minutes, but at no time, eve? ce 0 
75 minutes, did the color of the surta the 
the sphere disappear. The paper of oe ciel 
inner surface of the sphere was const! at 
was an orange-red, and the authors a ae 
the red component tended to be Te wish 
making the orange surface more yello no 
Only during very brief periods last"? sre 
more than a few seconds when the ey par. 
held fixed did a fairly good gray an evi 
These authors interpret their results # jsio® 
dence against the Hering theory of end 
with its concept of a mid-gray as that 
result of adaptation. They conclude peld 
“eye-movements alone, all other factor a of 
constant, are sufficient to arrest the cour with 
adaptation so that complete adaptation 974.” 


ter 


structure. However, Helson ? reported 3 
published confirmation of his and 
findings and pointed out (Helson, 196 
complete adaptation is not found W! 
intense stimulation, a 

The results obtained by Hochber# cbt?” 
bel, and Seaman (1951) concerniné. at? 
matic effects in the Ganzfeld contrast ê yud 
with those obtained by Helson 2” 


2 Personal communication, 1965. 
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VISION IN THE GANZFELD 


(1932). In the first of a series of studies, 
Hochberg, Triebel, and Seaman produced a 
ferveseneous field by taping halved table- 
ennis balls over the eyes of their observers: 
Pacey Mulated one group by a 100-watt light 
tps through a red Wratten filter (no. 70) 
a ee by the same light passed through 
sittin filter (no, 54). In the group 
initial, ed by red light, 5 out of 6 observers 
field in reported a red-colored surfaceless 
color Owed by the disappearance of the 
ing A within the first 3 minutes. The temains 
imen sctver of this group reported a three- 
ark Penal surface which remained a “very 
ieee Townish magenta.” All the observers of 
tiay oe? Stimulated with green light ini- 
Owed parted a green surfaceless field fol- 
total ao ain a maximum of 6 minutes by 
Point 'Sappearance of the green. The one 
Hochber Which the Helson and Judd and the 
is that e Triebel, and Seaman studies agree 
tion js the final state of chromatic adapta- 
ing. Pin the mid-gray postulated by Her- 
tle col ereas Helson and Judd reported lit- 
Man a loss, Hochberg, Triebel, and Sea- 
black Ported final adaptation to range from 
In ie Nothingness”) to dark gray. 

Triebel net Phases of their work, Hochberg, 
While th and Seaman covered the right eye 
= Breen | left was adapted to either a red or 
Coveri light. They stated that when the 
Sbsepy, 8 Was removed from the right eye the 
lumin *t reported the perception of the il- 
Sery Ant color and three fifths of their ob- 
D Ocalized this perception in the non- 
los ed right eye. Again, they reported the 
color perception with the right eye 
adapt, longed stimulation, although this 
tinea Was not as satisfactory as that 
both eya, {th simultaneous stimulation of 
Meng YS or of one eye alone. The move- 
the fos a finger so as to cast a shadow on 
tye “er portion of the field of the adapted 
Shag Produced the experience of a black 
™ with a halo of the color of the illumi- 
Was ie Ye movements, after the loss of color 
ean ee by the observer, produced no 
ance of color for half the observers 
for g Stig flashes of either red or blue-green 
adug Other half. Accordingly, the authors 
“Nite ed that eye movements have only 
ability to restore color perception. 
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Interruption of the red light for approxi- 
mately 2 seconds produced a complementary 
blue-green for 9 out of these 10 observers, and 
reestablishment of the red illuminant pro- 
duced a brief flash of red. 

Hochberg, Triebel, and Seaman's use of 
halved table-tennis balls to produce homo- 
geneous stimulation deserves notice. Other 
investigators have used this technique (e.g., 
Weintraub, 1964). It is highly likely that 
this technique produces a different field from 
that produced by a larger stimulus field at a 
greater distance from the eyes. 

Producing the homogeneous field by a 
technique different from that of either Metz- 
ger, Helson, and Judd, or Hochberg, Triebel, 
and Seaman, Cohen (1956, 1957, 1958b) 
also found a loss of chroma within an adapta- 
tion period of 3 minutes for 80% of his ob- 
servers, the remaining 20% reporting only a 
“trace of color.” As previously noted, Cohen 
produced either a homogeneous or inhomo- 
geneous field by varying the chroma and/or 
luminance of two adjoining spheres. The ob- 
server looked into one of these spheres, and 
both the luminance and chromaticity of each 
sphere were independently controlled. The 
observer’s visual field could be made either 
homogeneous or inhomogeneous by the pres- 
ence or absence of color or brightness gradi- 
ents between the two spheres. 

The reports from Cohen’s (1957, 1958b) 
observers are largely in agreement with the 
results obtained by Hochberg, Triebel, and 
Seaman (1951). The most noticeable dif- 
ference in the results of these studies is that 
Cohen’s results would support the medium 
gray end state of color adaptation called for 
by the Hering theory, whereas Hochberg, 
Treibel, and Seaman reported an end state 
of too dark a gray to be in accord with Her- 
lt sas Cohen suggested that this dif- 
ing’s position. i he fact that 
ference may be attributable to the 
Hochberg, Triebel, and Seaman used a lower 
intensity of illumination and longer periods of 
adaptation. Cohen also reported that domi- 
nant, long wavelength illumination made the 
Ganzfeld fog appear closer and denser than 
did dominant, short wavelength illumination. 
His observers reported feeling less “im- 
mersed” in the “blue” field, and to Cohen this 
suggested that the receptors sensitive to long 
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wavelengths are more densely concentrated 
than those sensitive to short wavelengths. 

Cohen’s introduction .of a differentiated 

area into the field changed its phenomenal 
characteristics. When this differentiation was 
due only to chromatic differences between 
the field and the 8-centimeter spot, the field 
was described as more saturated than was the 
homogeneous field. The addition of an inten- 
sity difference to the chromatic difference 
reduced the saturation of the field to the 
level reported for the homogeneous field. This 
was true except when the difference in in- 
tensity was such as to make the spot achro- 
matic and black; in this case, the saturation 
of the field surrounding the spot was in- 
creased. And, as might be expected by classi- 
cal contrast, most of the saturation enhance- 
ment occurred in the area of the spot figure. 
Cohen (1958b) stated that the presence of 
a chromatic difference within the field does 
not radically change most of the phenomenal 
characteristics of the Ganzfeld, that its prin- 
cipal effect is on the experience of hue. It is 
noteworthy, however, that a chromatic dif- 
ference produces a separation of figure and 
ground, albeit a relatively indefinite one. 
With a chromatic difference, observers de- 
scribed a well-saturated figure in a poorly 
saturated ground, even when the stimulation 
from the spot was achromatic. Again, Cohen 
pointed out that this Separation is primarily 
in one dimension of experience only, that is, 
color. With larger intensity gradients between 
figure and field, Separation of the two be- 
comes possible on other dimensions as well; 
such aspects as spatial separation, differences 
of contour, and textural differences between 
figure and ground become more pronounced, 
and a reduction of saturation in the field 
occurs. As noted above, with moderate in- 
tensity gradients Cohen’s observers reported 
the fog of the field nearer and the differen- 
tiated figure farther away, while an increased 
intensity gradient reversed this relation, 
causing the field fog to recede and the figure 
to approach the observer, 

The most characteristic effect of adapta- 
tion to inhomogeneity in the field Cohen 
(1958b) found to be the formation of a halo, 
usually involving desaturation and darkening 
of the field, immediately following the for- 
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mation of a small, bright, highly saturated 
area encircling the spot. This halo may Fo 
sist of a mixture of colors which a if 
fused, but appear to exist independent va 
different planes. Although spatial nee 
istics of the halo vary, it was most often E 
as being in a plane behind the figure ee 
front of the field, appearing, thus, to i 
tate the separation of figure and er te 

Although it was performed in a a tre 
field, a study reported by Helson and WE 
(1940) provides an interesting gompa i 
with Cohen’s findings in a chromatic ic in- 
Cohen reported the effects of chromat! the 
homogeneity to be restricted largely tO an 
color dimension of experience. Helson theit 
Jeffers found that object colors lose the 
individuality to a great extent when ngl 
illuminant changes from daylight to eg 
chromatic. With such a change pee 
that colors retreat into the plane of the ]mi€", 
ground, that they become softer and E. 
that demarcation between object and 5 
ground colors is not sharp, and conton a 
most disappear. In addition, the obs ich 
seems to be bathed in the illumination Chro- 
takes on a strong affective character. 5 ap 
matic illumination results in perceptio” 
Proaching those of Ganzfeld conditions: . ine 

In two additional studies, cessation © ure 
experience of color after prolonged ¢XP ct of 
to an empty field was found, An abstra ten 
a study unavailable to this writer e 
Doesschate (1961) stated that, whe ae 
empty field is viewed through colored E amit 
the perception of chroma gradually 0 of 
ishes and is often followed by a sensat! jod8 
dark grey... . [and in two cases] P 
of apparent darkness [p. 565].” net 

The most recent study conducted 64) 
empty field conditions (Weintraub, with 
Produced results apparently conflicting com 
the well-established principle of color “ont 
version (Helson, 1938), Helson’s stat ging 
of this principle is that in every Vstab” 
situation an adaptation level (AL) 38 fan 
lished such that objects having reflec e of 
above the AL will be tinged with the 
the illuminant, objects having Te hue 
below the AL will be tinged with the nan! 
the afterimage complement of the ilU™" eat 
and objects having reflectances at 
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the AL will be either achromatic or weakly 
Saturated and of uncertain hue. Weintraub’s 
aces adapted to a Ganzfeld which had 
Etc: a dominant wavelength of 648 mp, 
30 Tete urity of 1.00, and luminance of 
oe bert or a field having the same 
CEREN Wavelength and luminance but a 
Shaun Purity of .50. After 6 minutes 
ene to one of these two fields, his ob- 
648 E were exposed to postadapting fields of 
in edia a mj; 493 mp, or 568 my, varying 
0 aao i purity by steps of .25 between 
ing fro 00, at various luminance levels rang- 
Weint 00 to .60 footlambert. 
red ey Taub stated that with an adapting 
a ies oe purity, 8 out of 10 observers 
ing fr completely, adaptation times rang- 
these a 2 seconds to 12 minutes. Seven of 
Others v, Servers reported a dark field; the 
edad reported brightness but no hue. With a 
ol Me field of 50% purity, 6 out of 10 
ing a adapted chromatically, times rang- 
Servers y 2 minutes to 40 minutes. All ob- 
Purity ae adapted to the red field of 50% 
Purity a'so adapted to the red field of 100% 
lower? Bud adaptation to the stimulus of 
Purity always required more time. 
Ceggi oe experiences initiated by the suc- 
cintr contrast of the postadapting field 
Etric ao reported to be a function of colori- 
change Purity but not of luminance: “A 
stag in luminance from adapting field to 
Ness peting field affected perceived bright- 
a apti Ut not perceived hue [p. 559].” Post- 
to or ng fields of colorimetric purity equal 
fielq va ater than the purity of the adapting 
of co Pa reported as red; postadapting fields 
aq Ptin ‘metric purity less than that of the 
S field were perceived as a comple- 
lue-green hue. 
tra r of previous evidence clearly dem- 
ng the applicability of the principle 


ej nce dimension as a determinant of 
additi hue, Weintraub’s results call for 
Belso 4 evaluation. Note, for example, 
of po (1938) and Judd’s (1960) handling 
i te. Land (1959) phenomenon by color 
‘on t Sion, The present writer is of the opin- 
train àt a closer examination of Weintraub’s 
tatio 8 Procedure may change the interpre- 
" Of his results, Weintraub exposed ob- 
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servers to a red (648 mp) adaptation field 
followed by an achromatic postadaptation 
field and trained these observers to describe 
their experience in terms of hue, saturation. 
and brightness, for example, “hue, red; satu- 
ration, 50% ; brightness, 10.” The investigato 
did this to teach his observers the judging 
procedure and to establish a stable colo 
reference system. The color reference system 
thus established suggests itself as at least 
a partial explanation of the divergence of 
Weintraub’s results from previous ones. After 
exposure to one of two red adaptation fields. 
Weintraub introduced an achromatic field 
and trained his observers to call the color 
thus produced by successive contrast “blue- 
green,” an afterimage complement of the red 
adapting field. Helson (see Footnote 2) sug- 
gested as one possible reason for the conflict 
between his and Weintraub’s results the fact 
that the differences in luminance between the 
latter’s adaptation and postadaptation fields 
were not great enough to yield afterimage 
colors. For Weintraub’s observers the change 
was from .30 footlambert to .20 footlambert, 
whereas the nonselective patches of Helson’s 
1938 study had reflectance values ranging 
from .80 to .03. Thus, for Helson’s observers 
the light reaching the eye varied by a ratio 
of approximately 27-1, whereas the ratio of 
change for Weintraub’s observers was 1.5-1. 
Also, Weintraub’s results might have been 
different had the shift from adaptation to 
postadaptation fields been gradual rather than 
abrupt. The effect of rate of change in illumi- 
nation is best noted in the dimming phenome- 
non in which the complementary color re- 
sponse to a chromatic spot on a nonselective 
background is facilitated by slow, rather than 
rapid, dimming of the illumination of the 
spot. Illuminating a nonselective ground with 
chromatic illuminants, Higbee (1947) found 
that apertures centrally located in the back- 
cround showed conversion toward the after- 
image complementary of the surround as the 
ratio of aperture luminance to surround 
luminance decreased. These results, coupled 
with those of Helson and Self (1961; cited 
by Helson, 1964) showing the complemen- 
tarity of colored shadows to be a function of 
the ratio of the luminances of the two light 
sources, support the principle of color con- 
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version and present an explanatory challenge 
which Weintraub’s work fails to meet. At 
present there remains the challenging research 
opportunity of demonstrating that Wein- 
traub’s results, rather than denying the op- 
eration of the principle of color conversion, 
indicate its operation when the purity of the 
Ganzfeld is varied. 


ORIENTATION IN THE GANZFELD 


Upon exposure to the Ganzfeld, observers 
search hurriedly for something on which to 
focus the eyes in an effort to orient them- 
selves to the visual field (Miller & Hall, 
1962). These investigators found that ob- 
servers, finding no fixation objects, become 
aware of vitreous opacities and retinal blood 
vessels, report gradations of light and shade 
and, in extreme cases, hallucinations. In ad- 
dition to these phenomena, the observer soon 
becomes conscious of an uncertainty as to 
where he is looking. This uncertainty be- 
comes a serious problem if the observer’s task 
is to locate a single object. The observer not 
only does not know where he is looking, he 
also does not know where he has been looking 
previously. Miller and Ludvigh (1960) have 
shown that it may take as long as 20 seconds 
to locate an object as large as 6 times the 
threshold size when no external cues are avail- 
able. Ludvigh (1936; cited by Miller & Hall, 
1962) obtained evidence that with a small 
section of the cornea anesthetized the eye can 
be moved mechanically without arousing any 
sensation of eye movement. He also found 
(1952; cited by Miller & Hall, 1962) that 
using false cues can cause an observer to 
feel that his eyes are in violent motion when 
they are in fact stationary, From such evi- 
dence Ludvigh attributes disorientation in 
the Ganzfeld to the lack of proprioceptive 
feedback from the extraocular muscles. 

Miller and Hall prefer to interpret orienta- 
tion in the Ganzfeld as due to the fact that 
the individual orients himself in relation to 
some sort of internal bodily coordinate sys- 
tem. They define a displacement threshold 
as the ability of the individual to place a 
single test object in the center of the visual 
field in the absence of all visible detail other 

than facial features. These investigators pro- 
duced homogeneous stimulation by means of 
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a clear Plexiglas cylindrical annulus contain- 
ing a fogging solution through which ise 
observer viewed a uniformly illuminated 
white field. A test object subtending a visu 
angle of 25 minutes at the nodal point of the 
eye was moved in a horizontal arc approx 
mately 11 feet from the observer. This 
object was randomly positioned across 4 : g 
degree horizontal path, and observers XE 
instructed to return it to the center of to 
field. Subjective center was found to be s 
the left of true center for half the observe 
and to the right for the other half, ean 
displacements being 2.37 and 2.94 dean 
respectively. Seventy-five percent of the a he 
these observers moved the test object 1? a 
appropriate central direction when it oo 
displaced as little as 2.7 degrees from ee 
center. Miller and Hall also provided "Geld 
ence positions at various points in the The 
to which observers returned test objects- ore 
appropriate reference was presented 

each displacement of the test object, ® 
under these conditions the displace™¢ 0 
thresholds for different positions average’ at 
degrees. These investigators conclude el 
although the eye may lack an exten die 
accurate proprioceptive mechanism, the ’ ac 
vidual appears to possess surprisingly of 
curate spatial orientation in the absent’ ds 
retinal cues. Subjective reports of me 
used by their observers in locating the © 
of the field indicate that they resorte the 
orientation cues from various parts 3 unk, 


body, for example, neck, shoulders, 
and hips. 


LD 
Form PERCEPTION IN THE Ganzee® 


e 
, The most primitive level of form per? by 
tion in the Ganzfeld is that report ue 
Cohen (1957) who found that the intr? 


. . ant 
tion of a brightness gradient into the nd 
feld at first reversed the usual figure-8"° ipe 


the usual relation of figure to ground. 
also found the spot was judged ab" 
times as large as its real size. y 
Prolonged homogeneous stimulation 
to result in the failure of the percept’ agh?” 
anism to produce a phenomenal field. 


m 
se’, 
eo 


Ee 
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tg terms this phenomenon “blank out” 
lone it is best described as a temporary 
told 10n of vision. This investigator tested 
iir a l ability to discriminate forms dur- 
ing T out by tachistoscopically present- 
Plexit: my second) figures varying in com- 
Dresentatio as a circle and a star. Time of 
| cars of the forms coincided with the 
lank Pat mdication that he was experiencing 
Plex figure With these observers the less com- 
nthe ires were usually correctly identified 
case s of no blank out, but in no 
f the f the observer able to recognize any 
ability ps during blank out. This lack of 
even P recognize the objects persisted 
gure ae Tepeated presentations of the same 
Ould Pe blank out. At best, observers 
Presence at ve only a “flash.” However, the 
Seem to z the forms did, in many instances, 
Xtreme erminate the blank out. In the most 
entatig, cases of blank out such a brief pres- 
t n of the figures did not produce even 
“Port of a flash Presentation of the 
Bures fop j i 

termi as long as 2 seconds did, however, 


In; 
ut, ate even the extreme cases of blank 


of pending his investigation of the effects 
Presenter ous stimulation, Cohen (1960) 
'Mmediat r series of 16 figures to observers 
neous «2, UPon their entry into the homo- 
the poo field, after a 90-second adaptation to 
hom omogeneous field, or in a field with a 
Periph cous center portion and a structured 
Xposuy, Y- He found that even a 90-second 
With po tO a homogeneous field interferes 
Mater Pa recognition, The extent of this 
entation SS iS somewhat ameliorated by pres- 
"eCogni Of the series of figures; accuracy of 
ably Pa for the first 2 figures is consider- 
facili Wer than for the remaining 14. This 
Mainta; E effect of series presentation is 
a ned even if exposure of the figures is 
t as 1/100 of a second, unless the 
etween presentations is extended 
*Bhizeg to 30 seconds, The most easily rec- 
Nong figures were the circle, square, dia- 
Most ANd equilateral and right triangles; 
Penta "cult to recognize were the star, 
Pte y hexagon, octagon, cross, and the 
Sh - Rectangles, parallelograms, the let- 
Wor, ? the letter L, and the inverted letter T 
Moderate difficulty. Cohen noted that 
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the detrimental effects of the uniform field 
are most pronounced in the recognition of 
the more difficult figures; although prolonged 
exposure to the uniform field reduced the 
accuracy of recognition for all figures, the 
greatest loss in accuracy occurred with the 
more complex figures. 

Miller and Ludvigh (1960) investigated 
the effects of homogeneous stimulation on the 
acquisition of both stationary and moving 
test objects. Their Ganzfeld was produced 
by the cylindrical annulus described previ- 
ously, and their 5 test objects subtended 
visual angles of 10.00, 12.34, 17.45, 29.55, 
and 59.13 minutes of arc at the nodal point 
of the eye. Stationary targets could be posi- 
tioned at any one of 29 positions within the 
field. Moving targets were moved horizontally 
from one or another of these 29 positions at 
a rate of .14 degrees per second, a rate pre- 
viously determined to be below threshold 
for moving objects. They found that the 
same general relationship held for all their 
experimental conditions; length of time re- 
quired to locate a test object was inversely 
related to the size of the test object. Acquisi- 
tion time increased over a range of approxi- 
mately 3 to 120 seconds, with the increase 
becoming quite rapid when test-object size 
was less than approximately 20 minutes of 
arc, This result was unanticipated in view of 
the fact that visual acuity previously meas- 
ured through the fogging solution of the 
Ganzfeld was found to average 3 minutes of 
arc for the 3 observers in this study. These 
authors pointed out the observer’s lack of 
awareness of where he is presently searching 
or where he has previously searched in the 
Ganzfeld and noted that this introduces an 
element of chance into visual search in the 
Ganzfeld, Calculating probability of acqui- 
sition as a function of target size, these 
authors indicated that the probability of 
target acquisition does not reach a range of 
approximately .82 to .92 until target size has 
increased to approximately 60° minutes of 
arc. Using as a base the previously deter- 
mined average visual acuity of 3 degrees, 
Miller and Ludvigh found it necessary to 
increase the angular subtense of the target 
by a factor of 10 over the measured visual 
acuity in order for acquisition to occur 
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within a relatively short time (approximately 
3-5 seconds). 


MOVEMENT PERCEPTION IN THE GANZFELD 


Cohen (1958a) reasoned that if autoki- 
netic movement under conditions of darkness 
results from an inadequate visual framework, 
then similar movement of a figure in an 
illuminated uniform visual field should occur. 
Using the adjoining spheres described pre- 
viously, he exposed observers monocularly to 
a field containing an 8-centimeter-diameter 
spot. His subjects observed a series of situa- 
tions in which differences between the figure 
and field were systematically varied and, in 
about 40% of these situations, reported spon- 
taneous autokinetic movement of the spot. 
The predominance of movement to the right 
is probably attributable to the fact that 
Cohen’s observers viewed the field monocu- 
larly with the right eye. The frequency of 
reports of movement was not wholly depend- 
ent upon the kind of difference between fig- 
ure and ground, that is, either chromatic 
differences or intensity differences produced 
movement. When intensity was gradually 
increased the previously noted phenomenon 
of receding ground and approaching figure 
occurred. 

In a subsequent publication, Cohen (1960) 
reported blurring of contours, transforma- 
tions of shape, and disappearance of station- 
ary figures in the Ganzfeld. Observers were 
also found to be relatively insensitive to the 
movement of objects slowly rotated in the 
uniform field. When objects were moved at 
near threshold speeds, movement was often 
perceived in the direction opposite to the 
real movement, an extremely confusing ef- 
fect for the observer when he found that the 
phenomenal displacement of the object was 
in a direction opposite to that of his experi- 
ence of it. 

Miller and Ludvigh (1961) noted Aubert’s 
1886 estimate that the absolute threshold 
for movement increased by about a factor of 
10 when the subject was without a satisfac- 
tory frame of reference and Brown’s experi- 
mental demonstration (1931; cited by Miller 
& Ludvigh, 1961) that as the background 
became less and less structured, the phenom- 

enal velocity of a test object decreased, Mil- 
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ler and Ludvigh also reported an experiment 
in which observers were placed in a can 
with eyes closed, given a signal to ope? " 
eyes, and asked to indicate the Ga 
movement of test objects within the ee 
Two targets, subtending 19.1 and 51.7 m e 
utes of arc at the nodal point of the vias 
were exposed for either 5, 10, or 15 iat at 
angular velocities of .30, .56, -71, 
and 2.96 degrees per second. “Time dero 
the elapsed time between actual aye it 
the target and the observer’s report t 
had stopped, was the measure of 
ance. Time delay approached about 7 
ond for all three durations of movemen con 
all higher angular velocities. This cage 
value is probably a limiting reaction Hg 
since less than .5% of the respons ulus 
curred with shorter time delays. AS pa 
velocity decreased time delay increas 
a maximum of approximately ó- 
for 5-second exposures, 10.0 second P 
second exposures, and 15.5 seconds, 
second exposures. No statistical ® a i 
was found between delay times a5 4 ei 5 
of direction of movement. Reported 1” 
of target size rather than exposure an ced 
data showed that the larger target gee at 
shorter time delays than the smaller : 
angular velocities below 1.50 degree” 
second. These authors discussed theit " 
in terms of a hypothesis of “cont 
velocity” whereby the sudden cessati pvi 
high apparent speed is held to be MO , ap- 
ous than the sudden cessation of 10W 
parent speeds. They found, by 2 chive iped 
test, that the following formula 46S% pme 
their data: ¢= (r + 1/av), where * 
delay in seconds, 7 is reaction time i 
onds, v is the velocity of the test ob) nod”! 
angular degrees per second around t er ° 
point of the eye, and a is the parame wp 
inverse proportionality, The adequacy mathi 
formula in describing their data len¢® avis? 
matical support to Miller and M" 
“contrast of velocity” hypothesis. 


ELP 
ACCOMMODATION IN THE GANZ”. 


on! 
An empty visual field produces 2 oa y 
of increased refractive state of the ight it 
lar to the phenomenon known a5 * °? 


iio? 
jo” 
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Opla. The presence of an empty field also 
s fluctuations in the refractive state 
ers T Westheimer and his fellow work- 
ei (Campbell & Westheimer, 1959; West- 
caer, 1957, 1960; Westheimer, Campbell, 
ae 1958) have reported a series of 
tion ae fluctuating levels of accommoda- 
Sine meg by empty visual fields. West- 
Second as 1960) flashed a beam of .05- 
Pim uration into the subject’s eye, the 
the ot i through two fixed positions in 
Period ie pupil every 10 seconds over a 
Cris 30 to 40 minutes and giving in- 
Ee on concerning the refractive state of 
empty ane subject's field was either a dark 
Xation eld, a bright empty field, or a red 
as co at optical infinity. The observer 
Eoas to indicate, by pressing the ap- 
hich i button or combination of buttons, 
e Bies four response categories applied to 
isali tation of two vertically aligned or 
nimaa measuring light bars: over ac- 
ma under accommodation, correct 
lus e Odation, or unable to say. These stim- 
accom, Snditions brought into play positive 
1, ao ation usually of the order of .75— 
eo Ti The accommodative level did 
tun, ap 0 fixed throughout an experimental 
but 7, St runs showed no long-term trends, 
Las sy Were prominent cycle lengths of 
Dlituge 2 minutes, and peak to trough am- 
Harm S sometimes ranged up to 1 diopter. 
tion pC analysis showed the accommoda- 
Som me curves had strong components of 
Characye tency bands, but no frequencies 
conditi eristic of an observer or a stimulus 
(960) From these results, Westheimer 
State of concluded that since the refractive 
2 Struct, the eye is unsteady in the presence of 
h Cor Ureless field, it does not seem possible 
a e rect this myopia with a single lens. 
have noted that high-altitude flight 
piaj © empty field conditions (“sky” my- 
tive st nd, consequently, an unstable refrac- 
(19g ate, For such conditions Westheimer 
Serye, _DOted the probability that an ob- 
“orrec| Even when wearing the best “average” 
May tion for the task of target detection, 
ve Ty, Out of focus by perhaps .5 diopter 
Poon” Brown (1957), working from the 
Mody esis that optical devices that would 
© focus at optical infinity could per- 
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haps correct the unstable refractive state, 
employed collimated reticles and obtained 
negative results. He suggested selection of 
normally hyperopic persons for high altitude 
target-detection tasks. 

According to Koomen, Scolnik, and Tousey 
(1951), the phenomenon of night myopia 
was noticed as early as 1883 by Lord Ray- 
leigh. This is nearsightedness developed by 
the eye in dim illumination, necessitating 
negative lenses to counteract the decremeni 
in visual acuity. Chin and Horn (1956) men- 
tion several bases for this phenomenon: ac- 
commodation, increased positive spherical 
aberration due to pupil dilation in dim illumi- 
nation, the combination of chromatic aber- 
ration and the Purkinje shift, and the fact 
that a point is not imaged on the retina as a 
point but forms a caustic along the visual 
axis near the retina due to spherical aberra- 
tion. 

The magnitude of night myopia is a matter 
of some dispute; values vary from nearly 
zero to more than 2 diopters. For example, 
Otero and Duran (1942), according to Koo- 
men, Scolnik, and Tousey (1951), reported 
values of 1.5 to 2.0 diopters, while Wald and 
Griffin (1947) found much more variation 
from subject to subject. These investigators 
found for their subjects changes in the power 
of the eye ranging from + 1.4 to — 3.4 di- 
opters with an average of .59 diopter. Both 
Koomen, Scolnik, and Tousey (1951) and 
Helson (1952) drew attention to the fact 
that such conflicting results are more than 
likely because various experiments have been 
carried out at widely differing levels of illu- 
mination using a variety of measures. 

Paralleling the lack of agreement as to the 
magnitude of night myopia is the disagree- 
ment as to its cause or causes. This disagree- 
ment has been largely between Koomen, 
Scolnik, and Tousey on the one hand, and 
Otero on the other. The former based their 
conclusion largely on the fact that night 
myopia persisted after accommodation was 
prevented by optical methods and by the 
use of homatropine, and that it could be 
produced by means of a lens system having 
the spherical aberration properties of the eye. 
Otero (1951) said of his work that “. , 
experimental evidence is presented proving 
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that the main cause of night myopia is closely 
related to the mechanism of accommodation 
[p. 942].” This experimenter, using a light 
flash too brief to change the accommodative 
state of a completely dark-adapted eye, pho- 
tographed Purkinje images to obtain a meas- 
ure of the state of accommodation of the eye 
corresponding to its state of rest. For 3 ob- 
servers this state proved to be 1.12, 1.22, and 
1.24 diopters. Otero maintained that spheri- 
cal aberration and chromatic aberration con- 
tribute .3 and .4 diopter, respectively, to the 
1.5 to 2.0 diopters of night myopia he ob- 
served. Adding the contributions of these 3 
factors to night myopia gives totals approxi- 
mately equal to the observed night myopia, 
and Otero concluded that accommodation is 
its main cause. More recently Chin and Horn 
(1956) measured the refractive state of the 
eye in dim illumination and in darkness us- 
ing an infrared skiascope, and they concluded 
that the contribution of accommodation to 
night myopia was generally small. 

Helson (1952) pointed out that the Koo- 
men, Scolnik, and Tousey conclusion, em- 
phasizing the role of spherical aberration, is 
difficult to accept in view of the fact that 
these authors failed in at least 3 instances to 
produce spherical aberration in the eyes of 
subjects. The conflicting results from Otero 
and Chin and Horn likewise leave uncertain 
the role of accommodation. Helson’s conclu- 
sion in 1952 that the problem was not closed 
is equally applicable at this time, 


GANZFELD “BLANK Our” AND ITS ELECTRO- 
PHYSIOLOGICAL CORRELATES 


In a series of 3 studies of vision in the 
Ganzfeld, Cohen (1956, 1957, 1960) reported 
for some observers a temporary cessation of 
visual experience; this phenomenon he termed 
“blank out.” It was found in 5 out of 16 
observers, 2 of whom reported it almost every 
time they were exposed to the uniform field. 
The experience was not simply the presence 
of a dark, undifferentiated field, but rather a 
“complete disappearance of the sense of yi- 
sion for short periods of time.” Cohen (1960) 
found various degrees of blanking out. For 
example, the observer may experience a feel- 
ing of light with no external reference. Or, in 

the most extreme cases, he may experience a 
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“complete absence of seeing,” this latter & 
treme generally occurring only after era 
sures of 10-20 minutes. Observers epu 
uncertainty as to whether or not their i 
were open; they were also unable to con : 
their eye movements. Although blinking r e 
eye movement usually brought about ur 
return of the visual field, this did nok y 
with the most extensive losses of vision. tent 
Cohen (1960) suggested that pers 
uniform stimulation without a ame 
field results in the failure of the percept’ 
mechanism to produce a phenomenal fe orts 
possibility supported by the fact that aw 
of blank out were found to be associate e 
the return of bursts of alpha activity. EEG 
pointed out that alpha patterns in a ab- 
record are usually associated with he 0 
sence of visual stimulation. The ames) 
alpha activity during blank out, he suse" g 
would indicate a functional similarity orm 
tween no stimulation and prolonged e sceP” 
stimulation, Individual differences in SY cot- 
tibility to blank out Cohen found to Dê igh 
related with individual EEG records; im 
alpha activity in the absence of visua ibil- 
ulation was associated with more suse det 
ity to blank out, Cohen and Cadwa col 
(1958) found supporting data for pis 
clusion, a 
b- 


periods. Electrical activity was i 
from six standard EEG electrodes. 
were initially instructed to describe ” tpe 
they saw as accurately as possible. me 
subject verbalized experiencing blank ° bic? 
was instructed to close a microswit¢ n al 
produced a signal on the EEG record e OF” 
experimental sessions, alpha scores va se, 
tained at the beginning and end of op” 
sion under “eyes closed” and “eye? yg e% 
conditions. Tepas found that all subje" yes 
Perienced blank out and that the alp rme” y 
ent during Ganzfeld viewing was mte op®, 

ate between “eyes closed” and “eye af 
conditions for all observers. There W4° ent a 
nificant absence of saccadic eye moven an 
blinking just prior to onset of blank ° 


VISION IN THE GANZFELD 


= dy experience, but the end of the ex- 
Blinkin P with eye movements or 
= £ to a statistically significant degree. 
trian in Increase in the number of viewing 
it aa easg the amount of alpha present 
oùt al not alter the blank-out/non-blank- 
alpha difference. 

inion om a typical high alpha subject in- 
of ongoin at, Irrespective of the general level 
was gi a alpha activity in the brain, there 
ou fine risa more alpha during blank 
“eee Ww eS blank out was not reported. 
alps s also a general increase in the level 
Drogressed a as the experimental session 
i Nowever, the amount of blank out 
ad become relatively stable at 10 
and diq Py the middle experimental session 
of alpha ot increase thereafter with the level 
nt Pree This finding indicates that 
unctio ank-out time need not be merely 
tivit, n of the general level of alpha ac- 
tionshi epas’ data showed a negative rela- 
between amount of eye-movement 


Dotenti 
Causing S end amount of blank out reported, 
that po am to conclude that the hypothesis 


Cant igh alpha activity is an important indi- 
Str ank-out susceptibility was not 
Ongly supported. P y 


Or z 
Contribs e tiled specification of the factors 
bank "8 to the relationship between 


Conti at and alpha activity deserves the 
Bigg Ed interest of physiological psycholo- 
Ki do the relationships between the 
Wo i Contributing to night myopia. These 
demons lems, along with the possibility of 
of co trating the operation of the principle 
S va ice conversion when colorimetric purity 

, are likely to be rewarding ones for 

researchers, 
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UP COHESIVENESS AS INTERPERSONAL ATTRACTION 


A REVIEW OF RELATIONSHIPS WITH ANTECEDENT AND 
CONSEQUENT VARIABLES? 
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The cohesiveness of small groups is d 
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AND 


efined in terms of intermember attraction 


and i 
eA do see for such an approach is discussed. The empirical literature, 
eo pi imarily to investigations published between 1950 and 1962 fe 
‘ave i p the aim of evaluating the status of variables hypothesized as 
To this P or consequent relationships with interpersonal attraction 
‘lie ant * studies from diverse fields, for example, group dynamics, person- 
cones learning, are brought together and categorized. Theoretical positions 
which fe the development of liking between persons and the effects 
kai yia R as upon subsequent behavior are also examined by specifying 
krh s from systematic formulations and comparing them with the 
ch data. The major intent of this paper is to document relationships 


which have been clearly established 
equivocal or unexplored. 


The a 
and Ors are concerned here with research 
he to the concept of co- 
aNd categori heir starting point in reviewing 
ton o ah zing the literature is the defini- 
Which  ; esiveness as “that group property 
ength of mene from the number and 
Wes Mie positive attitudes among 
hile jg tS Of a group [Lott, 1961].” 
nay be said that this definition 


esiven 


Mer 

e 

0 focy 

3 Ç esivenes on one of several components 

Doe, that aie there is good reason to as- 
Sit erpersonal attraction, liking, oF 


lye s 
Ce at 2 
ae Ù io eee among group members, 1s 
inger other cohesiveness of small groups, 
TAR ently factors are also viewed as 
Tost re y relevant or not. 
to archers in the area have sub- 
t aie position introduced by 
esive achter, and Back (1950) that 
e of a group, that is, the 
i a vigias to maintain their mem- 
in oe; is contributed to by 4 
ions h lependent forces, but most in- 
attr ave focused on one force, inter- 
y 6 ae When investigators have 
og, lis anipulate the cohesiveness of 
Rg Derap aDer wā 
a ly S prepared under a grant from the 
u 
> a 
t News Welfare. We are grateful to Theo- 
Version omb for his critical reading of an 
of the manuscript. 


and to identify those which are still 


groups, regardless of their nominal definition 
of the concept, the operations performed 
have typically involved telling the members 
of some groups that they would probably 
like each other, be congenial, etc., while tell- 
ing others just the opposite. Further, in 
measuring the cohesiveness of experimental 
or naturally existing groups, some socio- 
metric device is generally utilized to deter- 
much each member likes or is at- 
e other members. For the most 
r forces which are assumed to 
be reflected by, the cohesive- 
p have received less attention, 
investigators have manipu- 
ther than interpersonal at- 
g group cohesiveness, and 
d it along different dimen- 
veness in terms of 
herefore, only rela- 


mine how 
tracted to th 
part the othe 
determine, or 
ness of a grou 
although some 
lated variables 0 
traction in varyin; 
some have measure 
sions. Definining cohesi 
interpersonal liking is, t 
tively arbitrary. 

Tagiuri (1958) has noted that 


Factor analytic studies of mutual ratings by mem- 
r on the presence of three 


J] groups concu 


bers of sma the pi 
basic factors: influence and initiative, task com- 
petence, and like-dislike. While the first two are not 

very group, the last one is always 


applicable to e 5 ; 
A _,. [and] other things being equal... 
the category of like and dislike “packages” most of 
the determinants of interaction [p. 317]. 


In a more general vein, we may cite Bonner 


(1959): 
259 
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If we analyze group cohesiveness . . . in terms of a 
group’s attractiveness for its members, we are con- 
fronted by the obvious fact that without at least 
a minimal attraction of members to each other a 
group cannot exist at all [p. 66]. 


In the present discussion, we shall be con- 
cerned only with studies of interpersonal at- 
traction among group members and the rela- 
tion of interpersonal attraction to other 
group process variables. How do persons in 
coacting or interacting groups get to like one 
another? And, once they do to some measur- 
able degree, what are the consequences of 
this state of affairs for the group or for the 
behavior of the members? We want to take 
stock of the answers which existing research 
has already given to these questions and then 
compare the empirical findings with theo- 
retical statements, that is, with predictions 
which have been, or can be, made from some 
of the more systematic formulations of group 
behavior. 

Considered in this review are investiga- 
tions of real or simulated interaction among 
persons who are in association with one 
another on a relatively voluntary basis. 
Omitted are studies of special populations 
like prisoners or psychotics. Another limita- 
tion is provided by date of publication; our 
interest is in the recent research literature 
and the majority of the investigations con- 
sidered here have been reported between 
1950 and 1962. Other reviews have provided 
adequate coverage of the earlier literature 
(e.g, Cartwright & Zander, 1953, 1960: 
Kelley & Thibaut, 1954; Lindzey & Borgatta, 
1954; Riecken & Homans, 1954; Rosebor- 

ough, 1953; van Bergen & Koekebakker 
1959). We have also, for the most part, 
omitted unpublished studies and technical 
reports. Inevitably we will have bypassed 
investigations which some consider of utmost 
relevance despite our attempt at thorough- 
ness within the limitations mentioned above. 

The number of relevant and near-relevant 
papers is staggering, but while a small num- 
ber of phenomena or relationships have been 
demonstrated over and over again, other rela- 
tionships remain as hints, unexplored, or 
equivocal. We can hopefully document both 
the former and latter, helping researchers to 
move away from the known in the direction 
of either more refined investigations of clear 
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i , ' which 
relationships or concentration on those whic 


have not been certainly established. We y 
not deal with problems of methodology n 
measurement, but will focus on what hasi 
shown to hold over a variety of situa cae 
using various techniques to manipulate infer 
ditions and innumerable devices tO 

changes in dependent variables. 


ANTECEDENTS OF LIKING: EMPIRICAL 

RELATIONSHIPS : 
pe dis- 
h have 
quel 


attraction of one individual, A, to 2" 
individual, B. Within each category investi 
found a number of different types °*, ed 
gation which can be roughly class! 
follows: ‘abl 
n of iat 
atl 


1. Experimental manipulatio eq 
X (or Variables X, Y, etc.) with pa 
direct measurement of the depende” 
able, liking, or some other dependen e 
which is assumed to reflect interp“ 
liking, for example, attraction to Se eei 
In some cases this assumption will = other? 
made by the original investigator; | 
by the present writers, dit 

2. Natural groups which are know? "ist 


assume that these other variables #” f 
interpersonal feelings within the tion ° 
studied while in other cases the que ctor 
“which came first” cannot be satis! 
answered. a& 

igin 
We are primarily concerned with es 


measured by direct choice of A for W 4 
measured by A’s rating of B on a scale: of | 


. . i e 
the criterion for choice or rating n of 
general social nature as opposed to ° ud 
Specific work or task-oriented nature: 


; e 
concerned with the latter have heeh jeva 
ered if they were viewed as having ' 
to more general attraction. 


Interaction-Propinquity port 
A sizable number of studies have 5“ # 


i3 One sage 
the general hypothesis that interper™ act!” 
traction is a positive function of 1 


| 


GROUP COHESIVENESS 


Bovard z 

that A ak ce, DIDON) Das Ehon 
members and i nek toward individual group 
will increase w the group as a whole 
Eara dine aty iore de group: 
encouraged a where verbal interaction is 
i de a classes, 
Interaction a relationship between arare 
ratings of vithin groups and average affect 
mately Rtg ty members is approxi- 
at wine McKeachie (1954), too, found 
ered reliabl in group-centered classes dif- 
Centered c] bly from students in leader- 
liking for Se in the direction of greater 
ing school heir group. Propinquity in ‘board- 
reliably li lasas was reported to be very 
sonnet yg Ked with liking choices by Mais- 
imilarly “lman, and Fourment (1952) 
Were fa h college classes, seat neighbors 
another sig to become acquainted with one 
neighbors Tan more often than non- 
„tensity a yrne & Buehler, 1955), and the 
Whether est relationship formed (i., 
erie riend, friend, or acquaintance) 
mer than i to be greater among the 

z tudies of e latter (Byrne, 1961a). 
r tings, outside o natural or semi-natural 
ae also kN of the college classroom, 
aet onship = ed evidence in support of the 
ke anil Po opportunity for inter- 
ie and oi Gullahorn (1952) ob- 
dist of a Jar rviewed clerical workers in an 
ance was As corporation. He found that 
two rmining sean single most important factor 
tite employees of interaction between any 
toa r ‘ghee, and that there was a tend- 
hoy evelop tsent who interacted frequently 
sit sing Bijan aie of friendship.” In a 
frie udents, th or married veteran univer- 
bet ship we re P two major factors affecting 
the cen houses ound to be sheer distance 
the Ouses fag he the direction in which 
thos. study e (Festinger, 1953). In an- 
tesia eik ee an girls living together 
hogetts) donee s among themselves (house 
the sororit, 1 cantly more often than they 
1955 Ouse” 3 rg who lived outside of 
ew “The sod own (Willerman & Swanson, 
Dreqi member ometric preferences of bomber 
tion table fen were shown to be reliably 
ak an finie a knowledge of their loca- 
po Wledge ion on the plane, that is, from 
ttunitie of who they had the greatest 
s to interact with (Kipnis, 1957). 
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In a summer ca : 
Ied bors camp, Sherif and Sherif (1953) 
budding friendships as = by breaking up 
new groups worked pe ere pr thie 
friendship choices were a ayed together, 
away from the boys ie or shifted 
boys within the new grou rerliep towar 
study (Sherif, ‘ae a subsequent 
duplicated these results, Simil eg 1955) 
Heber (1957) have reported a y, eberan 
school children increase th at elementary 
ratings of each other on a aa eles 
scale following common small nah distance 
ence, and Zander and Havelin F960 ce 
that among three-man grou o found 
strangers in the Sesnweaian anal of initial 
ferred to remain in their own a men. pre: 
ing a period of interaction. Both oe meee 
mentioned investigations also der mE 
that the interaction-attraction Pcie a 
subject to variation in degree as s Di To 
of conditions such as success or bef 
task competence of group members lia 
_Even in a competitive situation indi 
viduals, following interaction, aie “in 
found to become more favorable in their 
ratings of the traits of both their own tea 
mates and their opponents (Wilson & Stes, 
1961), although the increase in norbe 
for teammates and opponents is not the DEA 
under all conditions. Finally, the reduction of 
prejudice by white persons toward Negroes 
has been reported to covary with increased 
contact between persons of different color in 
situations as diverse as a meat packing plant 
(Palmore, 1955), a housing project (Deutsch 
& Collins, 1958), and a university classroom 
(J. H. Mann, 1959). 
That sheer contact may be a necessary 
but not sufficient condition for attraction has 
been shown by other investigators. Festinger 
(1953), for example, has discussed a housing 
project in which few group memberships and 
little social life existed among residents. Here 
residents, who felt “forced” to live in the 
project by circumstances beyond their con- 


trol (housing shortage), were largely nega- 
their attitude toward their com- 
hbors. Gundlach (1956) 
sample of white women 


workers who were members of left-wing 
unions with strong fair employment prac- 
tices, persons in one subgroup with high 


tive in 
munity and neig] 
found, among 4 
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education and high social aims who were in 
contact with Negroes of similar education 
and aims to be most hostile and derogatory 
toward Negroes. This subgroup was also 
most antiunion. The other white union mem- 
bers, on the other hand, exhibited far less 
prejudice than that reported for samples of 
the general population. 

In two controlled studies, Stotland, Cot- 
trell, and Laing (1960) and Stotland and 
Cottrell (1962) found no significant rela- 
tionship between interaction and desire to get 
to know one’s fellow group members. The 
interaction in these investigations conveyed 
no information about members’ character- 
istics, was nonverbal, and only faces or heads 
of members were visible. This was not the 
case, however, in another investigation (of 
sorority girls residing in the same house) 
where only a slight correlation was obtained 
between ranks on a liking scale and frequency 
of interaction (Backman & Secord, 1962). 

It seemes likely, from the evidence avail- 
able, that interaction, to produce attraction, 
should take place in a relatively neutral 
atmosphere in which there are opportunities 
for verbal communication or the observation 
of one another’s behavior, 


Special Characteristics of the Group Situa- 
tion or “Atmosphere” 


Cooperation. Stendler, Damrin, and Haines 
(1951) observed groups of second-grade chil- 
dren working for individual reward and for 
group reward. Only under the latter condi- 
tion did positive interactions exceed negative 
ones. Groups of fourth-grade children were 
studied by Phillips and D’Amico (1956). In 
one condition rewards were distributed ac- 
cording to individual contributions while in 
another rewards were shared equally. Attrac- 
tion to members of one’s group, measured 
sociometrically, increased in more groups of 
the latter kind than in those of the former. 
Further, in those competitive groups where 
rewards were distributed fairly evenly þe- 
cause members were well matched, the level 
of intermember attraction either remained as 
it was or increased. Increased liking, accept- 
ance, or friendliness among subjects who 
have cooperated on a task, in contrast to 
those who have competed, has also been 
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zing with 
reported by Gottheil (1955) work iong 
eighth-grade students, by J. D ig a lab- 
(1951) who studied adults atten ie an 
oratory for training in groun: el condi- 
by Deutsch (1960) who varied te ihin 
tions under which reward was attaine 
college classroom groups. 

In a recent study by Solomon goen 
of subjects played a game while eac a 
was in a separate cubicle, unable t0 “itches! 
other, and communicated via electric ni 
In such a situation, too, sabies a part 
ceived a cooperative strategy from utual 
ners (i.e., one promoting marimi ag for 
gain) tended to develop greater Aa on- 
them than did subjects who receive y: 
cooperative strategy. d b 

Negative findings have been sepor ale 
Rosenbaum (1959) who had pairs o min 
undergraduates discuss together for g 
utes their choice of an ideal colle a sof 
end, cooperating pairs (defined in es as 
both members being eligible for O acantl! 
opposed to just one) differed sign 45° 
from competing pairs on a measure 
sumed similarity of personality traits, 
on a social distance scale of desis tra" 
tionship with partner. These findings ' jt is 
dict most of the others reported, nditio”? 
not clear what the responsible ah men” 
might be. In a study by Myers (196 ound w 
bers of competitive rifle teams were ir tea 
increase their esteem ratings of the! ti 
mates more than members of nonco™ 
teams, but in this case the individus kiné 
bers of competitive teams were not a 
against each other, but were working 
against other teams, ersi 

In summary, the hypothesis that P ork” 
will like one another if they have Yuti 
together for common rather than f0" oral 
ally exclusive or individual ends is 8° 
supported, k (1960) 

Democracy. White and Lippitt erim” 
have reported that, in the classic XP gp 
on social climates conducted by Den goor 
his co-workers in 1939, when the boY® “ead? 
were supervised by a “democratic 
there was more friendliness amonei 
than when they were supervise! 7 f 
“autocratic” or “laissez-faire” leade™ po 


es5; 
was, further, greater group minded? 


pairs 


the “gp 
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mutu wat 
Sp ion and more spontaneous sub- 
Steatest cette | representing the 
Yo sociometric d pity iniia- five ai pent 
of the two ex ata were obtained in either 
vestigation experiments comprising the in- 
based E a judgments of friendliness are 
tions, It is aa y on the analysis of conversa- 
studies ome then, that so few other 
test the hy ane designed to specifically 
groups, diffe pothesis that between similar 
members ee only in leadership style, 
other mor emocratic groups will like each 
Ones, e than will members of autocratic 
Ine r 
Were eae data bearing on this question 
of eighth m by Rehage (1951) in a study 
class SPET a social studies classes. In one 
teacher win was done by pupils and the 
e ines in the other it was done by 
ean Ma one. Students in the two classes 
acceptance atched on IQ scores and on an 
Year t a ejectar index. At the end of a 
Wo Pa was no difference between the 
of Dupils = in acceptance-rejection indexes 
Cral grou ut there was a difference in gen- 
Dresent P structure: well-defined subgroups 
the P the beginning tended to persist 
Patterns aa pil class, whereas friendship 
*e other ad been “modified perceptibly” in 
erkowi 
Vant Di (1953) has also reported rele- 
n Ser ation from an investigation involving 
aps ii of seventy-two actual conference 
Hoke ah nae and industry and the 
cg Pants, Le interviewing of individual par- 
nt ader permissiveness in discussion 
an ; t Was found to correlate positively with 
Cludeg 7 Of grou ae ich in- 
he ed a ine p cohesiveness (which in 
ag abers) Bie of personal liking among 
the Ving’ soe in groups characterized 
ee, analys rgent problems” to solve. Fur- 
ty Ups which indicated, however, that in 
Dee: ntent were permissive with respect 
Siti the cohesiveness index correlated 


Vi 


t tant and negatively with percent of 
i ndi n and problem solution by mem- 
kowi aes in the groups studied by 
hao shared, apparently, “an expecta- 
ts oe the designated leader, the 
» is to be the major behavioral 
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igh fink penp and the more this was 
? Si Á 

with their a ee yee he ae 

That there is a clear and simple relati 

ship between democratic participation mA 
consequent liking among group me a 
seems doubtful. It is more likely th E the 
degree to which group members’ role = : 
tions are fulfilled is the more e eal ae 
able in producing satisfaction with the g on 
and with each other. md 


Acceptance by Others 


A number of studies have followed the 
general procedure of manipulating condition: 
so that A believes himself to be liked (or n i 
liked) or accepted to some degree by B oe 
A’s attraction toward B is subsequentl 
measured. In an investigation by ead 
and Secord (1959), for example, subjects 
were given the names of others who would 
most probably like them (as judged by the 
experimenter’s evaluation of personality test 
data). At the first interaction session the 
number of these “probable likers” chosen as 
“team partners” was significantly 
subsequent interaction ses- 
h the subjects got to know 
1 effect disappeared. 
ure, Kelley and 


desired 
above chance. At 
sions (during whic 
one another) this initial 
In variations of this proced 
Shapiro (1954) and Dittes and Kelley (1956) 
found that subjects made aware of positive 
ratings of themselves presumably made by 
their fellow group members differed signifi- 
cantly from others made aware of negative 
ratings in their desire to remain in the group 
or to work with the other members. The de- 
pendent measures used in the earlier study 
included ratings of individual co-workers. 
Simiarly, Dittes (1959), who permitted sub- 


jects to see fictitious ratings of themselves 
hat the more accepted 


by others, found t 

members were significantly more attracted to 
the group qua group and that this was more 
true of persons with low as opposed to high 
self-esteem. 

In a study 0 
Pepitone and 
strated that in 
where the subject 
significantly more 
bers (two confede 


f the consequences of rejection, 
Wilpizeski (1960) demon- 
the no rejection condition, 
believed himself to be 
liked by his group mem- 
rates) than in the rejection 
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condition, he also tended to like his fellow 
members reliably more. Pepinsky, Hemphill, 
and Shevitz (1958) manipulated rejection 
and acceptance by having stooges react nega- 
tively or positively to attempts at leadership 
by subjects. Under the first condition, the 
confederates received a disproportionate 
share of negative sociometric choices while 
under the second, favorable choices were 
randomly distributed between confederates 
and other subjects, and the groups were sig- 
nificantly higher in “morale.” Harvey (1962) 
has shown a relationship between subjects? 
reevaluations of friends and strangers and the 
ratings presumably received from the friends 
and strangers: “agreement or near agree- 
ment of the source’s with subject’s self- 
ratings resulted in subject becoming more 
positive toward the source, particularly the 
stranger source. . . .” Also concerned with 
rejection, Snoek (1962) compared its effects 
when of an invidious and noninvidious na- 
ture, the latter being impersonal and not 
implying negative evaluation of the rejected 
person. Subjects exposed to the latter condi- 
tion showed a greater reduction in attraction 
to the group as measured by the question, 
“Do you want to remain a member?” but the 
reason for rejection did not make a difference 
on a measure of rejected subjects’ desires for 
contact with individual members, The reasons 
for rejection differed not only on the invidi- 
ous-noninvidious dimension, but also in terms 
of a subject’s expectation that he might join 
the group (bridge club). When the rejection 
was invidious it was based on personal quali- 
ties and skill (open to change), but when 
non-invidious it was based on the subject’s 
sex, that is, the other group members pre- 
ferred to have a girl, Utilizing a procedure 
designed to induce not rejection, but simply 
a “fear of rejection” in some groups and not 
in others, de Charms (1957) found a tend- 
ency (nonsignificant) for subjects in the 
former to find their groups less attrac 


f tive, as 
inferred from responses to a questionnaire. 
Some studies have manipulated acceptance 


by varying status within the group. In a 
role-playing situation, Zander and Cohen 
(1955) found that subjects playing high- 
status persons were significantly more at- 
tracted to their groups than those playing 
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low-status persons, and the former also a 
fered from the latter in having percei 
themselves as making a good first impi n 
and having influenced the group. pem 
and Macauley (1961) reported that m 
who believed themselves to have been ele ders 
by other group members as discussion o their 
tended to be more highly attracted to E 
fellow members both as social company 
and work partners if they also believe 
their high status could drop. 

In a questionnaire study of sta 
in a child welfare agency, Jackson 
obtained some evidence that prote: 
personnel selected by co-workers as bel 
“most valuable” in terms of their co? 
tions tended to be more attracted t ey 
groups and its members, especially ! ] 


jects 
cte! 


ff members 


(1959) 
fessional 
ng 


were in high contact with one anot om 
utilizing a natural environment, 

(1956) studied seventeen men, 
strangers, housed together in a © 
Throughout the period of interaction, g% 
was a positive relationship between K zed bY 
others and believing oneself to be ioari! 
them. Similar findings were reporte giu” 
by J. R. P. French (1951) and pase? 
(1952). French found that among th more 
on which friendship choices were ma ie and 
group members was the extent of act? iui, 
expected reciprocation of choice, an 
after investigating about thirty nip ”% 
groups, reported that the “relations ecti” 
tween perception of affect and ĉ ral! 
response has been found to obtain 8°” e 
irrespective of the size of the group: peth 


b 
A orent 


and the age of the subjects. . > < yal at 
A’s perception that he is positively a cle 
by B antedates his liking of B is “a oor 
from these particular studies, but he gte 
trolled experiments which have beeni i 
amply support the conclusion that 4 7 pal 
B if B likes or accepts A. That, 45 5 vik 
see later, one of the consequences © co 
B seems to be a tendency for A tO ds aa 
himself as also being liked by B 1e” f pot 
tional significance to the importance ° 
ality or reciprocation, 
Frustration-Threat pat pa 
tel 
Sherif and Sherif (1953) reported tayt 


g the 
o th 
d ¢ th 
her. AIS 
Nev comb 
nit y 
itor” 
the 
\ 
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Intergroup relations for boys at a summer 
camp was to solidify “in-group belongingness 
~ Solidarity . .. and to strengthen in- 
Made Earlier, Wright (1943) 
Paine porn the same general phenomenon. 
tion hs children were subjected to frustra- 
Tee the experimenter with a consequent 
creas se in cooperative behavior and a de- 
i time spent in conflict. No direct 
Th ae Was obtained of friendship or liking. 
cial es early study, the well-known “so- 
by Wee. investigation by Lewin (cited 
i ae & Lippitt, 1960) and his co-work- 
Were Ps autocratically led, submissive, groups 
Portio ound to surpass all others in the pro- 
wo whi friendly remarks among the boys. 
icon, * studies in which no direct data on 
te ae E liking were reported are also 
Axelr a Lanzetta, Haefner, Langham, and 
that od (1954) and Lanzetta (1955) found 
ing pops of reserve officer trainees work- 
ation one study, under the threat of “evalu- 
time and, in the other, under the stress of 
Ment and negative comments by the experi- 
frienal Were more sociable, cooperative, and 
tasks 4 than groups working on identical 
Myer. ut under no threat. More recently, 
three. (1962) reported that individuals in 
om “Man teams facing “adversity,” that is, 
geta. tion against other teams, pulled to- 
diq ; 22d rated one another higher than 
espig viduals in noncompetitive teams. 
Uceq € the fact that an absent member re- 
absen his team’s score on the day of his 
higher? absent members, too, were rated 
Memb by members of competing groups: 
thems ers of competing groups also perceive’ 
‘am elves as being more accepted by their 
ing Mates than did members of noncompet- 

3 teams, 
thay findings of some investigators indicate 
is inate: Shared threat-attraction relationship 
(195 “enced by other variables. Hamblin 
i e » for example, demonstrated that group 
ignition does not increase under threat or 
S When a likely solution requiring coop- 
Some” ÍS unavailable to the group members. 
Cha 8toups were subjected to arbitrary rule 
not SES while playing a game; others were 
liep le former groups were observed x 
folg l8nificantly from the latter in the 
ing areas of behavior, exhibited after 
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the “crisis”: less helpfulness among members, 
less praise of others, and greater antagonism. 
No data were obtained, unfortunately, on 
groups subjected to threat where a likely 
solution was available. Data reported by 
Mann and Mann (1959) may be interpreted 
as supporting Hamblin’s hypothesis. Class- 
room groups, meeting four times a week for 
one hour over a three-week period, were 
organized as task-oriented study groups to 
discuss assigned lists of reading or as free 
discussion groups. Ratings of the members’ 
desirability as friends increased in the former 
groups and decreased in the latter where, 
according to observers, the subjects were 
frustrated and angered by the indefiniteness 
of their situation. As in Hamblin’s study, 
perhaps these subjects saw “no way out.” 
That the threat shared by individuals 
should be one emanating from an external 
source in order for increased liking of one 
another to follow is suggested by the findings 
of Pepitone and Kleiner (1957). High status 
groups of boys at a summer camp (i.e., win- 
ners of two out of three tournament games) 
were told either that they would probably 
lose the tournament (threat) or that they 
would probably win. Following the threatened 
loss of status there was no change in the 
number of positive choices made among team- 
mates while among those told they would 
probably win there was an increase in positive 
choices. In this study threat was operation- 


d by a prediction of failure made by a 
se ak group but failure would 
ying on the 


aliz 
source external to the 
necessarily result from poor pla. 


art of the boys themselves.  — 
7 It is apparent from the studies cited that 


definition or manipulation of k oy 
from one investigation to the next. It — 
example, been equated with pone oy fe 
ition, with frustration or nes 

‘a and with threat of age In ne 
research (€.8-; Burnstein & Mc ER Ep 
poor evaluation of group performan pe 
task has been used to induce a conditio 

“shared threat.” Such studies, in which ac- 
tual success oF failure constitutes the major 
independent variable, will be discussed sepa- 
rately in a subsequent section. Despite dif- 
ferences in the operations used to create a 
stressful situation, the evidence examined 
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provides a fairly consistent pattern. We sug- 
gest that attraction among individuals will 
be found to increase when their common 
threat stems from an external source (i.e., is 
not a function of their own lack of skill), 
when there exists the possibility that coopera- 
tive behavior may reduce or eliminate the 
threat, and when single individuals cannot 
escape from either the group or the threat. 


Status 


Status Similarity, Thibaut (1950) manipu- 
lated the status of teams of boys by assigning 
differential roles to each team in a series of 
games, and the boys subsequently chose 
teammates for a further game. Those in high- 
status groups (especially peripheral or less 
popular boys) showed a significant increase 
in proportion of own-team choices. Popular 
or central members of low status groups 
which had been unsuccessful in an attempt 
to improve their status also increased their 
proportion of own-team choices. We shall 
consider this latter finding again in our dis- 
cussion of the effects of the success-failure 
variable on interpersonal attraction. Our 
concern here is with the consequences of 
status defined by operations other than suc- 
cess or failure on a task. Further evidence 
appears in a report by Festinger (1953) in 
which he compared a housing project for 
married veteran students (Westgate) with a 
government project outside of a large city 
(Regent Hill). The inhabitants of the two 
projects were described as differing sharply 
in the evaluation of the prestige and desira- 
bility of their respective communities, The 
majority of those living in Regent Hill re- 
sented having to be there (because of the 
postwar housing shortage) and thought of 
their neighbors as lower class. The students 
of Westgate, on the other hand, were living 
among preprofessional peers and had a gen- 
erally positive attitude toward their com- 
munity. It is not surprising, then, that many 
more informal and formal groups were estab- 
lished and friendships developed among 
neighbors in Westgate than in Regent Hill, 
In another field study of industrial work 
groups, Seashore (1954) found a small but 
significant positive correlation between per- 
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ceived status of one’s job and an intel 
group cohesiveness (which did not inclu er- 
measure of intermember liking). Since fing 
ceived status of job was determined by aS job. 
individuals whether they had a ‘good sing 
it is not clear whether the obtained ive 
reflect a relationship between cop an 
and job prestige or between cohesivenes 
job satisfaction. Jo! 
Other relevant research has been cr rom 
groups composed of members centage i 
one another in status, Individuals a 
professional prestige, attending a oo like 
conference, were found to prefer othe indi- 
themselves for friendly interaction gren 
viduals of lower status (Hurwitz, Z iuation 
Hymovitch, 1960). In a laboratory ʻi desira 
where status was manipulated by the nigh 
bility of job performed, subjects m acces 
Status but mobile positions tended to others 
persons at their own level and — ig 
(Kelley, 1951), although there wen i 
nificant differences between conditi gs ) 
choice of best-liked person. Gonen 's 
has reported similar findings to Kel 4 
garding high-status subjects. Jennings ri 
also found that mutual choice was 8 irls 
among the overchosen or high-status ,“(4j- 
(the leaders) on a “live and work wi als? 
terion question. Mutual rejection ie the 
greatest among these girls, suggest! man 
operation of competition or rivalry (P h P” 
1961) under the same conditions whic 
duce attraction, rt pe 
The above studies generally supp?’ pigh 
conclusion that members of the saM? tus 
status group or members of mize : 
Soups occupying high positions will 
like one another, cy {of 
Status Dissimilarity, That the tend en” pets 
high-status persons to be attracted t° pabl” 
of high status (considered above) is P™ ela 
independent of a similarity-attracl? i jod 
tionship is suggested by findings wit sith 
cate a general preference for PiS ovite 
persons, Hurwitz, Zander, and divid" i 
(1960), for example, found that in pose 4 
low in Professional status liked ¢ eat 
higher status more than they lik ic 
other. Data on direction of commi galing 
Support the sociometric findings- ing wil 
Greer, and Gilmore (1955), workin8 


ne on 


50) 
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military men, also found status to be highly 
related to sociometric ratings, and Tagiuri, 
Kogan, and Long (1958) reported that col- 
ege preparatory students tended to choose 
for a roommate), without perceiving them- 
Selves chosen in return, individuals with high 
choice status, while they perceived themselves 
i“ Pee chosen by individuals of lower 
tue In the previously mentioned Kelley 
ti ra (1951), the low-task-status but poten- 
ally mobile subjects exhibited friendly be- 
avior to high-status subjects, as judged 
a an analysis of written messages; this 
tobii a true, however, of low-status, non- 
pe individuals. Cohen’s (1958) replica- 
in a this study obtained similar data and, 
subj dition, found that nonmobile, low-status 
Teens sent significantly more cohesiveness- 
t an ane comments to their own subgroup 
nonm, did mobile, low-status subjects. The 
Perie Obile, low-status subjects may have ex- 
whi nced a condition of common frustration 
ich, p erhaps, was a more powerful variable 
of Status in influencing the development 
tee ae affect toward other group mem- 
S. Findings from another study (Thibaut 
lecken, 1955) suggest that some people 
Need © more prone than others to be influ- 
OF rej by relative status in their acceptance 
iene of persons. Men high in authori- 
e nism, after experiencing aggression at 
ip, 22ds of an instigator, rejected him less 
if pp Status was high and rejected him more 
tarias Status was low than did less authori- 
n men, 

individ, mber of investigations have found that 
influe uals tend to like persons who have 
incly ced them, and it seems reasonable to 
Sion CE Such findings in the present discus- 
Usually the direction of influence is most 
of 7.» from a position of high to a position 
VO status. Whether influence prečedes 
Clear 0N, or vice versa, is, however, no 
in? Since the data have come from studies 
Per a influence has not been manipulated; 
tha, Utter (1954), for example, has reporte 


Within adult discussion groups the 


Bri 
Ti ember, 


the X the perceived influence ofam 


bing Ore desirable were the traits assigned to 
Corral in some groups a significant positive 
Vale: ation was also obtained between the 

nce of a group member with respect tO 
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“most prefer working with” and his perceived 
influence. Earlier, Lippitt, Polansky, and 
Rosen (1952), in a replication of an earlier 
study, found that boys in a summer camp 
situation tended to like those boys to whom 
they also attributed high power (or influ- 
ence potential). Tagiuri and Kogan (1960) 
have reported positive correlations between 
liking of others and feeling influenced by 
them as well as between liking of others and 
feeling influential over them. The latter find- 
ing, together with evidence from Kelley 
(1951), suggests another phenomenon toward 
which additional research might well be di- 
rected. Kelley found that high-status, non- 
mobile subjects, that is, those not threatened 
by loss of status, exhibited as much inter- 
level friendliness as did subjects in the non- 
hierarchic control condition. 

Status Congruence. Adams (1953) found 
that in bomber crews where members were 
in the same status relationship to one another 
on a number of different dimensions, for ex- 
ample, age, education, role prestige (where, 
in other words, status congruency was high), 
there were more friendships than in crews 
where status congruency was low. Similarly, 
in a study of supermarkets by Clark (cited 
by Homans, 1961, pp. 225 ff.), congeniality 
between “ringer” and “bundler” was found to 
vary with status congruence: Congeniality 
was high when the ringer was equal to, or 
higher than, the bundler in both general work 
responsibilities and social position, Exline 
and Ziller (1959) set up congruent and One 
congruent decision-making groups in terms 
of status derived from each member’s ability 
on a task and the worth of each eee 
vote. Congruent groups were rated ee 7 ot 

rs as significantly more congenial. An in 
ors ee con by Raven and French (1958) is 
vestigation Dy dicted that the more 
also relevant. They predi k a 

$ ived as having a legiti- 
a supervisor was DE higher would 
mate right to her position, the hig - 
be the ratings she received from group 


pers on a liking scale. The results, although 
not statistically reliable, were in the predicted 
direction. Judging status along a standard of 
legitimacy may be considered of the same 
nature as comparing status on one dimension 


with that on another. 
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Behavior or Personality Characteristics 


Here we will consider investigations which 
have related the attractiveness of an indi- 
vidual to his specific characteristics, regard- 
less of those of the individual attracted to 
him or the nature of their relationship. 

One class of such studies is represented by 
those in which persons differentiated on the 
basis of personality test scores are compared 
with respect to their likability or the extent 
to which they are chosen by others as friends 
(or on some related criterion). Masling 
et al. (1955), in two separate investigations 
with army men, tested the hypothesis that 
“equalitarians” will receive more sociometric 
nominations than authoritarians because they 
exhibit greater warmth and less hostility. This 
assumption was based on previous research 
by Sanford and by Eager and Smith (cited 
by Masling et al., 1955) in which equali- 
tarians were found to be perceived as warmer 
than authoritarians, and camp counselors 
judged as hostile scored higher on a measure 
of authoritarianism than those not so judged. 
Masling and his co-workers found that a 
small but significant positive relationship 
between equalitarianism and sociometric 
choice did exist where army status was held 
constant. Chapman and Campbell (1957) 
have also reported a small, insignificant cor- 
relation in the same direction: highly authori- 
tarian individuals (as measured by the Cali- 


fornia F Scale) tended not to be chosen as 
desirable teammates, In 


study, Kates (1959) ha 
stimulus persons descri 
The objectively more 


greater power, exhibiting 
being more personally attractive. He was not 
however, judged as being f 
These findings, then, do 
cited above since the difference in authori- 
tarianism between the two stimulus 
was not recognized by the subjects, 

A number of investigators have pursued 
the question of whether individuals with 
relatively positive self-concepts are more 
attractive or better liked than persons with 
less positive self-concepts, McIntyre (1952) 
found no significant difference, in responses 
to a self-attitudes questionnaire devised by 


not contradict those 


persons 
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Phillips, between the most highly and T 
least accepted men within a college samp 
A similar study by Fey (1955), using jg- 
own self-attitudes scale, also obtained e Sal 
nificant relationship between self-accep “a 
and acceptance by others, but subjects nas 
high self-acceptance and low acceptan a 
others tended to be rejected by others. tance 
tive findings with respect to agence 
and acceptance by others have been rep 

by Miyamoto and Dornbusch (ieee 
well as by Reese (1961). In the latter 5 
of elementary school children, evidence 
obtained for a curvilinear relationshiP, | 
is, high acceptance, measured by both anie 
and ranking, was found to be accomp e 
by moderate self-concept scores as Mea 
by a scale devised by Lipsett. 

Still other characteristics have beet. uals. 
to distinguish attractive, likeable ma nics 
Kelley (1950) had students in an eco yhom 
class rate a substitute teacher, abat a 
they had previously received informat P wing 
seven personality-behavior scales, f° ose 
a discussion led by the substitute. peer 
students to whom the substitute par te 
previously described as “warm” eae j t 
rated him more favorably than those p ul- 
he had been described as “cold.” Usin8 and 
tiple criteria to determine acceptance s id 
rejection among dormitory members 952) 
girls’ college, Lemann and Solomon ated 
found that highly accepted girls ae and 
by their peers as “generous, enthusiast! 7 ere 
affectionate, while girls with low status ‘and 
generally rated more stingy, apathetlt? dat? 
cold.” R. L. French (1951) obtained pat 
among naval recruits which indicate ffens? 
men with sick bay and disciplinary A thet! 
records tended to be less acceptable fagit 


idy; 
was 


ting 


found 


peers than were other men; an pre? 
(1952), in comparing 15 maladjuste n ee? 
school boys (ice, boys who had bee” yel 


by the school Psychiatrist) with l ceived 
adjusted boys, found that the latter "° tne 
more positive sociometrie choices frO™ no 
peers than did the former, A similar P a 
enon in a quite different environme! al 
been reported by Speroff and Kerr el mail’ 
Among a group of workers in a y a 
interpersonal desirability was negativan i 

related with number of personal acc! 90) 


With the exception of Kelley’s 
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pe ie investigations mentioned in 
the cual ing paragraph leave ainauswered 
leer 7 of which variable, positive be- 
and ehi ative attractiveness, is antecedent 
ats ena Sonenn This is true, as well, 
constructior by Van Zelst (1951) in which 
desirabilit a who were rated high in 
significantly z others were found to differ 
generally z rom those rated low, in being 
more fave r adjusted to their jobs and 
company a le in their attitudes toward the 
ings report cpa is more apparent in find- 
men who ad by Kipnis (1961): Male fresh- 
est fieis ascribed negative traits to their 
off these A mee than to themselves) broke 
tiod, mar ct within a six-week pe- 
ascribed requently than did those who had 
than t more positive traits to their friends 
P © themselves. 
su con and Sherberg (1957) found that 
volving tit ho read scripts of an incident in- 
cantly hj ig person rated him signifi- 
iked hi gher on a liking scale, that is, dis- 
aving a less, when he was perceived as 
at iene intentions” than when his 
Other st, were believed to be bad. In an- 
udy with Wallace (cited by Pepitone, 
abing ects who failed test items under 
Menter A ne imposed by the experi- 
Who ti him more than did subjects 
Studies Med justifiably. In both of these 
ts is g an individual’s attractiveness to oth- 
e ae to be influenced by his particular 
whether in a given situation, regardless of 
erson „this behavior is characteristic of his 
"h ality, 
i Nae both more enduring traits and situa- 
ess jg cific behavior determine attractive- 
(1 Suggested by the findings of Jennings’ 
iong pe hig esos of interpersonal rela- 
Schoo] rie: four hundred girls in a training 
Chosen be delinquents. Girls who were over- 
anq Wo. es their peers on the criterion of “live 
5 Ps with” are described by Jennings 
oup itive “to the elements of the total 
te Fas hie to a very much greater ex- 
an the average member”; they con- 


i 


trip 

u 

ed constructively to the development 

high Positive experience of others, displayed 
ded to internalize 


Milas it de corps and ten 
So iy worries, Compared with the other 
Bitls? ess aggressive, more cooperative, and 

> the highly chosen were less quarrel- 
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morg initiative-taking. Similar findings have 
peen reported by other investigators. For ex- 
ample, with respect to the variable of senei- 
tivity, Gage and Suci (1951) found that 
high-school teachers who were more accurate 
in predicting their students’ opinions on 
school related matters were also those who 
elicited positive affect from their students. 
Popularity has been related to accuracy i 
the perception of others’ interpersonal pref- 
erences within infantry rifle squads (Greer 
Galanter, & Nordlie, 1954), and to accuracy 
in the perception of others’ popularity and 
group dimensions within classroom groups 
(Bugental & Lehner, 1958). Other character- 
istics of liked persons, similar to those re- 
ported by Jennings, are also supported by 
subsequent investigations. Haythorn (1953) 
found, among naval reserve officer trainees 
that those men who were most often chosen 
by their peers on a number of sociometric 
criteria were those who most facilitated the 
effective functioning of their groups, who 
were rated cooperative, efficient, and insight- 
ful, and who scored high on personality 
measures of maturity, adaptability, and ac- 
ceptance of others. Positive relationships have 
also been reported between likability and 
number of interactions initiated in discussion 
(Norfleet, 1948), judged value of contribu- 
tions to the group (Theodorson, 1957), and 
frequency of participation (Shelley, 1960). 
In the Theodorson study, the cited rela- 
tionship was limited to certain groups, those 
high on an index of cohesiveness. Other lim- 
iting conditions are suggested by an investi- 
gation by Bales (cited by Riecken & Ho- 
mans, 1954). Individuals in problem-solving 
groups rank ordered each other on four cri- 
teria at the end of each of several meetings. 
Scores on «contributed best ideas to solve 
problem” and “did most to guide discussion 
correlated highly with frequency of both ini- 
tiation and receipt of interaction, but “ike” 
scores correlated highly with the other scores 
only at the first meeting and not at subse- 
quent ones. A man who had ranked high at 
the end of the first meeting on both the “best 
ideas” and the “ike” scales tended to keep 
the latter status and lose the former by the 
end of the second meeting. Thus, in certain 
kinds of groups, specification of which is 
still unclear, “task master” and “good fel- 
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low” roles (Riecken & Homans, 1954) may 
be antithetical. In another investigation, 
Bales (cited by Homans, 1961) found that 
group members tended to dislike those lead- 
ers who talked more than they were talked 
to while leaders “whose ratio of receipts to 
initiations was high were apt to be chosen 
highly for both guidance and liking [Ho- 
mans, p. 306].” Related to this general ques- 
tion, Jennings (1950, p. 272) noted that al- 
though many of the choices made on the 
“live or work with” criterion were the same 
as those made for leisure time companions, 
there were also differences; and the founda- 
tions for the latter choices seemed to be 
“based more exclusively in warmth and good 
feeling, as ends in themselves.” 

A large number of investigations concerned 
with individual personality and small-group 
performance (reported between 1900 and 
1957) have been reviewed by R. D. Mann 
(1959). He concluded that the data on 
popularity support a positive relationship 
with “extroversion, intelligence, adjustment, 
and conservatism.” 

It is apparent that the attractiveness of an 
individual is enhanced by certain behavior 
which may be either temporary and situation 
bound or enduring and central to his person- 
ality. This behavior, as seen from the research 
summarized above, can be described in terms 
of warmth, equalitarianism, good adjustment, 
sensitivity, helpfulness, and so on, Such be- 
havior might be further describe 
erally, as that which should result in interac- 


tion that would have pleasant, as opposed to 
unpleasant, consequences, 


d, more gen- 


Similarity 


No other variable considered thus far has 
been the object of more investigation than 
this one. One of the major problems, how- 
ever, is that the question of whether mutual 
attraction is a consequence or an antecedent 
of similarity is not always answered, although 
there have been a few controlled studies di- 
rected specifically at clarifying the direction 
of the relationship. A related problem in this 
area of research concerns the measurement of 
similarity. B’s similarity to A is sometimes 
objectively determined; other times A simply 
perceives B as similar to himself (and perhaps 
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erroneously). In the latter case, does A ea 
ceive B as similar to himself because he li e 
him, or is he attracted to him perme, 
believes that B is similar? It is probable be 
both relationships hold, but here we sha Hs 
concerned with the latter. Data bearing we 
the former will be considered later when n 
examine findings relevant to the conseque 
of attraction between persons. ; 
Background (Race, Ethnicity, Oceupait j 
Age, etc.), In a laboratory group comP irls, 
half of Jewish girls and half of Catholic fa 
the girls in each category split thei fore 
equally between Jews and Catholics ffilia- 
members were identified by religious ° is 
tion. After identification, the Jewish fio 
continued to split their votes, but the aroup 
lic girls overchose within their sube a- 
(Festinger, 1950b), In a subsequent S e 
tion where religious identification ° 
voter was not possible, both Jew!S? neir 
Catholic girls voted in the direction 0 sis 
respective subgroups. Choice on the ba 
similar religion has also been reporte oys 
Goodnow and Tagiuri (1952) among © ng 
attending a liberal preparatory scho? perge 
Zaleznik, Christensen, and Roethlis found 
(cited by Homans, 1961, pp. 214 f a in 
that in a Boston industrial plant gas: t 
the Irish majority were much more lik of £ 
be approved and accepted than thos? vest 
different ethnic background, Other tio’ 
gators have reported similarity of occuP ong 


to be a basis for friendship choice pi P. 
adult members of training groups duck” 
French, 1951), and similarity in the js {0% 
tion and salary of fathers to be 4 ba mone 
not desiring a change in roommate 
freshman girls (Broxton, 1962). tbe 
In a field study by Seashore (1954) oas 


cohesiveness of industrial work groups pii 
ured by responses to questions 4 ve 
group as a whole, was found to be P%® tag 
related to within-group age variance, io 
homogeneity, and not at all related 
larity in educational level of membe" gg 
hesiveness did correlate positively ge 7 
nificantly, however, with the percent ov 
members in the group who had H ny* l 
three years of service with the compa 130 i 
1S Not surprising to find that the ee # 
of similarity on particular attribut 
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vary in di 
not a ee group situations, or, in other 
given chara homogeneity with respect to a 
friendship nang will not be related to 
arsfeld and a in all groups. Thus, Laz- 
that within e i “ae (1954) have reported 
Craftown es of two housing communities, 
Pennsylvania, ew Jersey and Hilltown in 
the degree of similari 
re milarity . . . i Š 
mort og Ne mnt i tag : a, oe 
those of the itation of intimate friendships among 
selectivity ene race and sex, to entirely negligible 
erms of educational status [p. 22]. 


Furth - 
factor in Belle was an important selective 
where simil illtown, but not in Craftown, 
dence, arity in political values took prece- 

Wh 
Make pte be operating in the tendency to 
to fice an 9 choices for individuals similar 
Suggested i in background characteristics is 
Ported þ 4 the results of two studies re- 
first, Won yrne and Wong (1962). In the 
ings shout” had subjects indicate their feel- 
ackgroun Po be pd on whom only minimal 
ighly prej ata were provided. As expected, 
More ne ejudiced white subjects responded 
pe ene to a Negro than to a white 
n “i while subjects low in prejudice did 
that ide oon study tested the hypothesis 
a greater A prejudiced white subjects assume 
Ween the egree of attitude dissimilarity be- 
an bet mselves and an unknown Negro 
White n themselves and an unknown 
dice do nr whereas subjects low in preju- 
Of iether 3 The hypothesis was supported. 
de P is the analysis of actual 
‘erences oe Ri there were not significant 
Neato eb etween a sample of white and 
ate dietetics on 26 items, indicating that 
Jects Perc larity highly prejudiced white sub- 
Broes yA eived between themselves and Ne- 
testion na function of their prejudice. A 
gaiis e raised by these findings which 
Os Hong. aug If we tend to believe 
client b ns who are like ourselves in some 
milar go ckground characteristic are also 
pae that us in attitudes, will it also be the 
p of hi when A perceives B t 
Seimi a tudes that he wi 
tHitude to himself in backgroun 
Ong (4 s. In a further study by 
962), white subjects were p 


attitu 


o share a num- 
1] assume B to 
d? 

Byrne and 
rovided 


with k i 
ee a information and the 
‘desi 4th sessed oe Taa 
un g es fr 
oe oo either a WHE Ge pe 
TR fg oon neni their feeling: 
ing one of seven pe Beeps F a 
ae led to believe that the ae 
ak s were in complete agreement with 
t own in direction, while the oth i 
jects were led to believe that the ein os 
attitudes were opposite to theirs. Peional 
feelings toward the stranger were T 
significantly influenced only by attitude a z 
my and not by race of the stranger, eile 
e subject’s degree of prejudice. These d A 
and other A findings reported by Rok ih 
(1960) indicate that belief congruence Ten 
under certain conditions, be more im) orian 
than race in determining aterpe : a 
erence. In this connection, Triandis (1961) 
has pointed out that whereas belief Be 
may be more important than ethnic PPE 
with respect to friendship choices, this is me 
the case with respect to relationships din 


{ ; in- 
volving relatively large social distance, such 
as acceptance : 


of a person as a neighb 

A r: ighbor or a 

a student in one’s university.” ‘ 
Similarity in interest was found to be more 


ith clique membership 


closely associated wi 
than either grade or neighborhood in a study 
of adolescents in a California high school 


(Marks, 1959), although the interests which 
were relatively homogeneous within cliques 
differed with the sex of the group. From this 
study it is not possible to determine clearly 
whether common interests preceded clique 
formation Or whether membership in the 
same group gave rise to shared interests. In 
some other investigations, however, there is 
less ambiguity regarding the independent and 

Broxton (1962), for 


dependent variables. 
example, found that freshman women WO, a 


the end of five weeks, did not desire to change 
more similar to their room- 
mates in attitudes, for example, 

oking, than were 


toward drin 
women who did want to change. 
A considerable amount of relevant data 


has been reported by Newcomb (1958, 1961) 
from a large-scale study of university men, 
initially strangers, who were housed together 
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in a dormitory. Measures of attitudes toward 
100 objects, as diverse as President Eisen- 
hower, house rules, and classical music, were 
obtained prior to the boys’ acquaintance with 
one another. For the first of two groups of 
seventeen students, “it turned out that clus- 
ters of most closely agreeing individuals, be- 
fore acquaintance, were those most likely 
later to constitute cohesive cliques [New- 
comb, 1958]. The preacquaintance indices 
of actual agreement did not predict to early 
sociometric choices, but to those made after 
the first two or three weeks, after, presum- 
ably, the men had had a chance to learn one 
another’s orientations to various issues. The 
attitudes of the men remained, in general, 
relatively stable while their “high attraction 
preferences tended to change in favor of in- 
dividuals with whom they were more closely 
in agreement [Newcomb, 1961, p. 254].” 
During the early weeks, judgments made by 
the men about each other’s attitudes were 
related to their feelings of attraction, but 
their judgments tended to be not very accu- 
rate. “With the acquisition of new informa- 
tion about each other, estimates of others’ 
orientations tended to become more accurate, 
and the preponderance of changes in attrac- 
tion were thus influenced by considerations 
of reality [1961, p. 255].” 
Byrne (1961b, 1961c) has supplied addi- 
tional data relating attitude-scale agreement 
with liking and has also reported (1961c) 
that the effect attitude similarity has on at- 
traction varies with strength of affiliation 
motivation. Subjects high in need for affili- 
ation responded to a dissimilar stranger with 
dislike but subjects low in affiliation need 
responded with indifference, It can be ex- 
pected that attitude similarity will not be 
given the same weight by all persons when 
they judge the attractiveness or likability of 
another individual, and further, that similar- 
ity on some attitudes will be more important 
for some people, that is, have more influence 
on such judgments, than similarity on others, 
Supporting data with respect to the latter 
expectation have been reported by Newcomb 
(1961), but both hypotheses are in need of 
further verification and refinement. 
Some studies have shown a relationship 
between attraction and agreement on a vari- 
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ety of relatively specific issues, as distinct 
from general attitudes. Festinger, Schachter, 
and Back (1950) found that in a student 
housing project consisting of U-shaped courts 
there was relative homogeneity within courts 
with respect to attitudes toward a tenants 
organization, and that those who deviate 
from the dominant attitude tended to be rè- 
jected on the sociometric question, “Who do 
you see most of, socially?” In another part a 
the housing area, however, consisting of iE 
of two-story buildings, the degree of hom t 
geneity regarding attitudes toward the = 
ants’ organization was even greater tha 


ogy sé A 0 
within the courts, but deviates were, ne 
rejected. In a theoretical paper, er 
y by ¥ 
(1954) reported data from a study Ravel 


inger, Gerard, Hymovitch, Kelley, and - 


? he 
(1952) which had not been given i 
original research paper. Groups of SU va 


had studied a labor dispute and then ¢ ri- 
ated the union’s behavior, After the ar 
ment, the subjects had been asked how W% 
they liked the other persons in their groun, 
“In each of the eight different experime cs 
conditions those who thought that the Ot a 
held divergent opinions were less — 
to the group [Festinger, 1954].” eae 
(1954) found that for some Air Force P 
sonnel interpersonal attraction was ass0¢!® ce 
with common satisfaction with the Air F” i 
or agreement with respect to personal thet 
mitment to Air Force goals, while for ° red 
men attraction was associated with set 
dissatisfaction with the air site an vind 
their jobs. In Hilltown, one of the oe 
communities discussed by Lazarsfel "ho 
Merton (1954), white residents tend? me 
overselect as friends those with the ani 
racial attitudes.” Newcomb (1953), of 
larly, has reported that in two replicatio”® y- 
a study in a small college community, iti- 
dents at each extreme with respect tO pe ike 
cal attitudes chose, as friends, thos? 540) 
themselves. Altman and McGinnies js 
have related similarity in ethnocenth to 
among group members to their attractio” w 
one another. Six-man groups, of varied ¢ ale 
Position with respect to the California e 
Scores of the members, viewed an? jes 
cussed a film dealing with ethnic minor tS 
and prejudice. Low-ethnocentric (E) si 
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Were found to be more apt to choose other 
TE subjects on an attraction question 
= high-E subjects divided their choices 
ut equally between highs and lows. High- 
er Fcis; however, were not as accurate as 
id Ow-E subjects in identifying the atti- 
es of other group members. 
ong Study (Gage & Exline, 1953) has pro- 
twee negative findings: no association be- 
“tn leisure-time, sociometric ratings of 
judged members and either their actual or 
Opini Similarity in response to a series of 
°n items regarding group processes and 
D of human relations. Perhaps opin- 
importa these questions were not especially 
erent ae the group members and a dif- 
erent Set of items might have yielded dif- 
Best a: Newcomb’s data (1961) sug- 
telateq at agreement on some issues is more 
5 to attraction than agreement on oth- 
the ia that these are the issues about which 
ro viduals are more concerned. _ š 
tese ative data within a different kind 0 
arg Ch setting have been reported by Ger- 
f situa 26D. Subjects in a modified Asch 
the sy n ranked highest, on a liking scale, 
line | Ject who always agreed with them in 
subje tgments, and they ranked lowest the 
tion St who never agreed with them. In addi- 
lisanne 8toup member who switched from 
Ory ment to agreement either very early 
hihan late in the judgment series was ranked 
the a an one who switched sómewhere in 
Asch iddle. In another investigation of an 
Lott ({oulP-Pressure situation, Malof and 
On likin 2)» though they obtained no data 
Congo; found that the support of a Negro 
judg “Tate who switched to making correct 
White ents was accepted by highly prejudiced 
Ment Subjects (exposed to erroneous JU lg- 
às it Y a White majority) almost as readily 
Preju Ys accepted by white subjects low m 
Ce. Acceptance of support was defined 
“ange, on the part of the critical sub- 
incorrect to correct judgments fol- 
Such a change by the confederate. 
ot Steement between Individuals A and B, 
Not Ssue X, but on their evaluation 0 
tor & individu l, C, has also been shown 
Ot ont positively to A’ d B’s evaluations 
the a Positively to A’s an ET 
iS a nother, Newcomb (1956) ou on 
ationship increases over time amons 


273: 


group of students living in the same dormi- 
tory, and Davol (1959) reported that among 
men in a Veterans’ Administration domicile 
those who either liked or disliked another 
man in common tended to like each other, 

If B’s agreement with A on C enhances A’s 
attraction to B, we can further suppose that 
B’s agreement with A on A, that is, agree- 
ment with A’s self-evaluation, should have 
the same result. Newcomb (1956) found this 
to be the case with respect to agreement on 
both positive and negative traits. Backman 
and Secord (1962), similarly, found that 
girls in a sorority house both like and interact 
most with those girls whom they believe see 
them as they see themselves (with regard to 
selected personality traits), and they like 
least and interact least with those girls whom 
they believe to have images of themselves at 
variance with their own. These data were un- 
related to differences in the social desirability 
of the traits. 

Deutsch and Solomon (1959) investigated 
the consequences of agreement on evaluation 
of task performance. Female telephone-com- 
pany employees were led to believe that they 
had performed poorly or well on a task and 
then received either an unfavorable or a 
favorable note from one member of their 
group. Although there was a clear tendency 
for subjects to respond more positively toa 
favorable note writer under all conditions, 
when this was held constant it was found 
that a note writer was more favorably evalu- 
ated when her opinion of a subject’s task 
as consistent with the subject's 
he note writer’s opinion was 
inconsistent with the subject's. A gens 
study (Howard & Berkowitz, any n 
shown that agreement among persons 1n = 
evaluation of an individual’s pegiormanoa 80 
affects his attraction to them. Subjects a ose 
task performance was judged by raa e 
ers tended to be less desirous of working nye 
the one whose judgment deviated from: a 
of the other three, regardless of whether the 
deviant evaluation was positive or negative. 

A final group of studies relevant to the 
present discussion are those dealing with the 
relationship between acceptance of A by B 
and A’s consequent attraction to B which we 
reviewed earlier. We assume that, in most 


performance W: 
own than when t 
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cases, acceptance by another is congruent 
with own feelings toward ourselves, that is, 
that self-attitudes tend to be favorable. 
Values. The Allport-Vernon Study of Val- 
ues (or one of its revisions) has been em- 
ployed in a number of investigations to com- 
pare friends with nonfriends or persons who 
like each other more with persons who like 
each other less on similarity of value profiles. 
Newcomb (1956), for example, has reported 
a significant relationship between actual simi- 
larity in Allport-Vernon scores and attraction 
and also between estimates of agreement 
(i.e. subjects’ perceptions of how others 
would rank order the six values in the scale 
compared with their own rankings) and lik- 
ing (Newcomb, 1958). This latter relation- 
ship was greater after fourteen weeks of in- 
teraction among men in a dormitory than it 
was after two weeks. A. J. Smith (1957), 
concerned with the causal direction of this 
relationship, gave to each of his subjects two 
partially completed Allport-Vernon booklets, 
one containing responses identical to those 
previously made by the subject and the other 
containing dissimilar responses, Subjects then 
rated each of the individuals, whose Allport- 
Vernon booklets they had presumably seen, 
on their desirability as social companions and 
as work partners. Acceptance on both meas- 
ures was significantly greater for 


thetical person with similar values, Reilly, 
Commins, and Stefic (1960) have reported a 
slight tendency in thi 


e direction of greater 
similarity in value scores between friends 
than between nonfriends, In still another in- 
vestigation utilizing the Study of Values, 
Jones and Dougherty (1959) found that 
subjects with high political scores evaluated 
another politically oriented person favorably, 
especially when competitive interaction with 
him was anticipated. When no interaction was 
anticipated, subjects with a high political 
value score chose as “liking best” a political 
rather than an aesthetic person. When, how- 
ever, scores on Christie’s Mach IV Scale were 
used as a measure of political 
trary findings were obtained. 
high on the Mach Scale, who can bi 
as manifesting a strong need to 
the social environment, 
others with a similar orie 


the hypo- 


values, con- 
Individuals 
e described 
manipulate 
tended to devalue 
ntation. These find- 
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$ low 
ings anticipate others to be desl I a 
which indicate that either similarity function 
plementarity between persons may another, 
to increase their attraction to one haractel- 
depending upon the nature of the =e upon 
istic (value or personality trait) Pa con- 
a number of other, as yet unspeciier; 
ditions. al 
A nonsignificant correlation Se p 
homogeneity and the ratio of a | natural 
group choices made by members Eisman 
college groups was reported Ra huis an 
(1959) and also by Ramuz-Niet the Bis 
van Bergen (1960) who ae r invest 
man study in Amsterdam. The late 0 


Jue 


L * 
or ie 
tered? 


friend. Significant Positive correlate se 
obtained between the ideal person so ” 
ings made by Pairs of friends a” 
tween the ratings of ideal and of 
the same individuals, One further st" 
Pertinent to the present discussion 
and Kelley (1959) have suggested ense) pal 
costly (in terms of effort and ox? aod BA 
maintain friendships over a distance “bi 
such relationshi 

pected to show 
ity.” 
cited 


seo 
piba 
pat i (0 


ý 
ps would, therefor’, sith) 
relatively high valu sis, A 
In support of this hypoth agms 
data (p. 41) obtained by WÍ 


ed sub 
ect; 
Vok 
acteq 
N whil 
Si 
ag 
Onal; 
ive + 
Wymor 
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a to the effect that greater agreement 
Ghd ng between a sample of individuals 
Phila. friends residing outside their im- 
individu tye than between the same 
eae and their friends living close by. 
stitute eais Personality Traits. What con- 
Te S e patibility has not, thus far, been 
ure A efined. Within the empirical litera- 
ings g are numerous contradictory find- 
simpl ut these apparent contradictions may 
require reflect the fact that compatibility 
and Similarity in certain characteristics 
Words, ome lementarity in others. In other 
clusivs ¢omPatibility seems not to be an ex- 
ta unction of one variable or the other, 
ce ee function of both. 7 
ti e studies have shown a simple positive 
Onship between attraction and similar- 
of objectively measured personality 
at Lindzey and Urdan (1954) reported 
together’ of students who chose to room 
Measure, were more alike on personality 
Nother than were those who rejected one 
Wards? po milariy, Izard (1960a), using Ed- 
Pairs of e oral Preference Schedule, found 
M Dro 1 tiends to be significantly more alike 
Pairs e scores than randomly established 
Correlation’ also obtained significant positive 
yan oad between friends, but not between 
ition; Pairs on the specific traits of exhi- 
Sm, deference, and endurance. In an- 
investigation Izard (1960b) demon- 
that the similarity in personality pro- 
ai etween friends exists prior to their 
tance, 


Tela: 


traits 


Dro mil 


dividual and those of the other must 
€ broadly defined. Winch (1955) and 
tsanes, and Ktsanes (1955), for 
» who compared husband-wife pairs 
domly matched men and women on 
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need patterns (as measured by the Thematic 
Apperception Test and interviews) found the 
existence of complementarity among the 
married pairs, especially with respect to the 
assertive-receptive dimension. In another re- 
search context, Gross (1956), studying 
informal groups within an Air Force popu- 
lation, reported that some groups were 
composed of men with dissimilar character- 
istics and that in these “symbiotic” groups 
each man seemed to have something to offer 
the other; other groups (consensual) were 
made up of men with similar characteristics 
who shared a goal or a set of values. Schutz 
(1958, 1961) has more explicitly defined and 
investigated these two kinds of compatible 
groups, which he called reciprocal when there 
was complementarity of three basic inter- 
personal needs, and interchange when there 
was similarity in interpersonal needs. He has 
shown that such groups tend to be productive 
and cohesive and that their members tend 
to like one another and to desire continued 
contact. 

Negative evidence with respect to both 
similarity and complementarity has also been 
reported, Thus, Bowerman and Day (1956), 
using Edwards’ Personal Preference Schedule, 
found no support for either homogamy or 
complementarity in the needs of 60 college 
couples, regularly dating or engaged; and 
Hoffman (1958) found no greater inter- 
member preferences within groups that were 
homogeneous with respect to profiles on the 
Guilford-Zimmerman Temperament Survey 
than within heterogeneous groups. A con- 
founding variable in the latter study, how- 
ever, was the agar among groups in suc- 

roblem solution. ` 
ace also been shown that under a 

circumstances a person may choose to i. 
on a particular task with a dissimilar fe ah 
Lerner and Becker (1962) demonstrated hat 
similar person is preferred for interaction 

i mutual-gain or noncompetitive situation, 
wile a dissimilar person is preferred for 
; etition. No data were gathered on 
EE but the results seem to suggest that 
the similar individual is the more likable 
fer to share rewards with him. 


since we pre ae 
A related investigation was conducted by 
Israel in 1956 (cited by Homans, 1961). 
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Each subject saw both his score, on a series 
of logic tests, and that of the other members 
of his group. Each subject believed his score 
to be average while those of his fellow mem- 
bers varied from high to low. Subjects were 
then asked to rank order the others in his 
group with respect to their continuance in the 
group for further solution of logical problems. 
Under a condition where reward would be 
based on the group’s total score, choices 
tended to go to high scorers; under a condi- 
tion where reward would be based on score 
homogeneity among group members, choices 
tended to go to other average scorers; and 
under a condition where reward would depend 
on individual performance, choices tended to 
be directed toward low scorers. Again, these 
results do not indicate liking, but suggest, 
simply, that work-partner preferences will be 
made in such a manner as to maximize per- 
sonal gain, although it was also the case that, 
overall, high scorers were most preferred. 

A number of studies have been primarily 
concerned with the consequences for inter- 
personal liking of similarity not in the actual 
personality traits of individuals, as measured 
by some relatively objective instrument, but 
in the traits which they judge each other to 
have. Most notably identified with such re- 
search on “assumed similarity” is Fiedler, In 
an early investigation (Fiedler, Warrington, 
& Blaisdell, 1952), each of a group of fra- 
ternity members was asked to describe, by 
Q-sorting 76 Personality items, himself and 
his ideal self, and to predict how his best- 
liked and least-liked fellow members would 
describe themselves, The results indicated 
that best-liked persons were perceived as 
more similar to both self and ideal self than 
were least-liked persons, No significant dif- 
ference in similarity was found, however, 
between actual self-descriptions made by 
subjects and best-liked choices, and self- 
descriptions made by subjects and least- 
liked choices. The results of other studies 
have also suggested that assumed similarity 
with a liked person may be greater than 
actual similarity. Preston, Peltz, Mudd, and 
Froscher (1952), for example, found that 
husbands and wives tend to rate each other 
on specific traits in the same way they rate 
themselves and that this tendency is greater 
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for happy than for unhappy couples i 
correlations between self-ratings 1960) ob- 
gible. Similarly, Reilly et al. Si e at 
tained no support for either os a mit 
complementarity in the actual nee awards 
tual friends, as measured by segs actual 
test, but among 15 correlations of lity needs 
scores of subjects on single personal Y iends, 
and the scores they predicted for their 
10 were significant and positive. hich sup 
Still other data can be cited w ‘el simi- 
port a relationship between as 
larity and attraction. Lundy, Ka 
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Study of changes in  self-concepts; or 
that a sample of male freshmen P 

smaller differences in personality ode! 
themselves and those whom they CM” tacts 
a variety of friendly, interpersonal © 
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least, 


a erson? 
In some Investigations, the P 
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vidual members have pe 
termined by means of group juee 
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groups, Maisonneuve (1954) foun 
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(1960). They asked each subject, following 
interaction with others on a task, to predict 
how the majority of the members in his 
Stoup would respond to a 30-item person- 
ality questionnaire, and also to respond to it 
'mself, When only subjects of no prior ac- 
(uaintance were considered, those who indi- 
a high desire to get to know their 
L OW group members differed from those 
ge low desire in perceiving the majority 
the group as significantly more like 
themselves. 
The findings reviewed above, relevant to 
e relationship between assumed or judged 
Personality of others and attraction, do not, 
A the most part, provide a satisfactory 
Swer to the question of causal direction, 
at is, whether A’s perception of similarity 
tween himself and B disposes him to find 
lap ie, or whether A tends to see in a 
Ae sle: B personality characteristics which 
sumag ett with his own. We have as- 
Tevjen from the context of the investigations 
lewed in this section, that they bear on 
ormer relationship; later on we will con- 
Others which seem more clearly relevant 
the latter, 
‘hen we put all the evidence together, 
can be little doubt that individuals tend 
Prefer friendly associations with others 
tite are compatible to themselves in inter- 
, values or personality. The major prob- 
“be In this area is specification of what 
Stitutes compatibility in situations of 
"ying characteristics. 


5 
“ccess-Reward 


Shared Success or Failure. Typically, data 
ap cerned with this variable have been ob- 
raed from experiments in which the success 
t failure of small groups, working on a single 
| = or series of problems, has been manipu- 
p °c. Utilizing such a design, Shelley (1954) 
“nd that members of successful groups had 

Be Ote favorable attitude toward their 
th UP than did members of groups experi- 
ei g failure. Steiner and Dodge (1956) in 
terpo Panipulated success and failure by in- 
Seng ring, in one condition, with messages 
Vig ,.2¥ group members to one another 
(‘« lights and buttons. In this condition 
“ceptual inaccuracy”), incorrect mes- 
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there 
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sages were received, while in another condi- 
tion there was no interference and hence 
“accuracy.” When no specific rules had been 
provided by the experimenter for members 
to follow in performing their task (design 
reproduction on a checkerboard), the former 
condition was significantly associated with 
less task efficiency, with more criticism of the 
group, and with greater rejection of own 
group members for a new task. Working with 
groups of second- and fourth-grade children, 
Heber and Heber (1957) gave some groups 
a high score on an arithmetic test, some a 
low score, and some no score and then mea- 
sured changes in the mean ratings group 
members gave one another on a social dis- 
tance scale. Under the low score condition, 
the ratings decreased; under the success or 
neutral condition, they increased, with the 
effect showing greatest permanence after 
success. Pepitone and Kleiner (1957) intro- 
duced the threat of losing a game tournament 
to high-status (winning) teams. Where this 
threat was negligible, intermember attraction 
increased significantly more than within high- 
threat teams. Between low-status (losing) 
teams told they would probably win and 
those told they would probably lose, how- 
ever, there was no difference in subsequent 
in-group choices. It is unfortunate that there 
are no data reported on the level of inter- 
personal attraction within the winning and 
losing teams after they had experienced 
success or failure and prior to the introduc- 
tion of other variables. 

In a study of attraction and personality 
similarity in which group success was not 
manipulated, Hoffman (1958) found that in 
homogeneous groups successful problem 
solving by the group was positively corre- 
lated with intermember attraction. More re- 
cently, Zander and Havelin (1960) investi- 
gated interpersonal liking as a function of 
variation in the task competence of trios 
within larger nine-man groups. Subjects in 
the most competent trios made significantly 
more work-partner choices and fewer rejec- 
tions within their own subgroup than did 
subjects in the poorest trios. Similar findings 
obtained on a scale of general attitude toward 
the subgroup. (The perception of similarity 
among subgroup members on characteristics 
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other than task competence was also ma- 
nipulated and found to have a greater effect 
on the peer choices and group evaluations of 
subjects in the poor than in the successful 
trios.) 

Both Wilson and Miller (1961) and Myers 
(1962) studied the effects of success in 
competitive situations and utilized, as mea- 
sures of liking, personality ratings made by 
each member of every other member. In the 
first-mentioned investigation, winning was 
associated with a larger positive shift in the 
ratings given to their own teammates than 
was losing; in the second study, high-success 
teams increased significantly in esteem for 
their teammates, while esteem for teammates 
diminished in low-success teams. Phillips and 
D’Amico (1956) had earlier reported that 
within groups in which members competed 
against each other, where rewards were evenly 
distributed, there was either no change or an 
increment in within-group choices on a socio- 
metric question, in contrast to groups in 
which rewards were not evenly distributed, 
where within-group choices decreased. 

Both Deutsch (1959) and Zander, Stot- 
land, and Wolfe (1960) have reported data 
indicating a positive relationship between 
group success and the attractiveness of the 
group as a whole; in the latter in 
however, responses to a scale m 
traction to members of the gr 
differentiate between subjects in failing and 
successful groups, There have been other 
reports of equivocal or negative findings. 
Thibaut (1950) found that groups of boys 
who tried to obtain better treatment from 
the experimenter and failed significantly in- 
creased in cohesiveness (as measured by pro- 
portion of own group choices), while groups 
of boys who were successful in improving 
their status did not. This difference in height- 
ened attractiveness of own group members 
held, however, only between the central 
(more popular) members and not between 
the peripheral members of unsuccessful and 
successful groups. In a number of other 
studies the experience of common failure has 
been shown to influence the choice of one fail- 
ing subject for another on a variety of criteria, 
Shaw and Gilchrist (1955), 
found that after a sequence of 


vestigation, 
easuring at- 
oup did not 


for example, 
task failures, 
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iendl: 
subjects chose among themselves for ee 
association more than they chose suce ever 
individuals. The failing subjects, bowel 
had experienced interaction only wm (1987) 
other. Berkowitz, Levy, and eager task 
found that subjects who were high ralia 
motivated and received unfavorable € ently 
tions of their task performance si ae 
rated each other as high in aiar anion 
(on both work partner and social cone fete 
criteria) as did highly motivated j Burn- 
who received favorable evaluations. ups 
stein and McRae (1962) studied groups 4, 
four white subjects and one Negro Caper: 
erate in a situation where group ™é muni- 
separated by vertical partitions, CO™ solve 
cated by means of written messages iie 
logical problems cooperatively. no highly 
evaluation of a group’s performance, Neg!’ 
prejudiced white subjects rated the juation 
higher on a liking scale when the eva white 
was poor than when it was good. Formante? 
subjects low in prejudice, the perfor” ir 
evaluations made no difference m cal? 
rating of one another on the liking 
under the different conditions. ure by 
Is the experience of common fai E red 
members of a group interpretable aS ourc? 
threat or stress emanating from a lpi 
external to the group? It would be ethe" 
relative to this question, to know W peit 
subjects in unsuccessful groups wew 
common failure as arbitrarily impos” ack 
the experimenter or as due to their OW” 
of ability. In the former case, but not jo? 
latter, heightened interpersonal ath 
would be expected. Pepitone and * 
(cited by Pepitone, 1958) have show. 
when subjects consider their failure 4° gat 
or unjustifiable they dislike the invest’? pe- 
more than otherwise, but no data W° aber 
Ported on attraction to other group mer and 
We may make use of the Pepito? pat 
Wallace findings, however, in inferring, hot 
arbitrariness of evaluation probably A and 
Operate as a variable in the Burnstei ple 
McRae study since there were no "° 
differences between the favorably and f 
evaluated subjects in their ratings gine? 
tests’ “fairness” or in how much they, 
the experimenter. It is possible th@ cif? 
Burnstein and McRae findings are 5 
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fe ations which prejudiced white persons 
attractiy, Negroes, that is, Negroes are more 
Te ene! hey such white persons when they 
PGs evaluated than when they are 
se evaluated. Other data obtained in 
e the oa suggest that this may 
ated the tsi Favorably evaluated subjects 
members ommunication skills of their fellow 
favorabl significantly higher than did un- 
ido nated subjects, but the Negro 
Pared Sith was considered less skillful com- 
.. Members oe average group member by 
Wsuccessful successful than by members of 
udy (Kat groups, Findings from another 
“that e & Cohen, 1962) also suggest 
a skillful perspns may be less attracted to 
“llectivenes egro regardless of their own 
Sna Ae composed of one Negro 
Operative] ite subject solved problems co- 
training» y. In one condition (“assertion 
formati. ) the experimenter varied the in- 
y e available to each partner so that 
te AE confidently propose the cor- 
~ With lon, but each partner was favored 
Problems, better information on half the 
training In the other condition (“no 
hormati, neither partner received sufficient 
trect oe to permit him to propose the 
"Xperience ution confidently. Following this 
tion = e the Negro subjects in both condi- 
for their Tessed the same degree of preference 
but f Partners (for a future experiment), 
tio rai white subjects who received “asser- 
less = y preferred their Negro partners 
“ondition did those in the “no training” 
ui 
ime pr 


j 8 al findings with respect to the 
tion 
i 


Oposition that interpersonal attrac- 
p tccessfur greater in successful than in 
leing groups have been reported by 
one me (1960). In group situations where 

T Proyj mber was clearly responsible for im- 
tate] 8 group performance (or, more accu- 
ther r decreasing the “likelihood of failure”) 
"atings aS no significant increase in subjects’ 

Of he Of each other, but only in their ratings 
Pro em One member responsible for the im- 
Hoy w ent. On the other hand, in the condi- 
i ere there was only a small reduction in 
tog 00d of failure, group members tended 


Preciate each other, apparently “holding 
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each other responsible for the relatively small 
improvement.” 

It seems clear that members of successful 
groups tend to like one another more than 
do members of unsuccessful ones. It is also 
true, however, that under certain conditions 
intermember liking follows shared failure 
especially, we suggest, where the failure is 
perceived as arbitrarily imposed by an 
external source. 7 

Attraction to Source of Reward. As in the 
case of shared failure, shared success, too 
may be seen as emanating from an external 
source in a more or less arbitrary fashion 
or it may be perceived as due to the skill or 
other characteristics of one’s fellow group 
members. Such a distinction could not be 
made unambiguously in the case of the in- 
vestigations discussed in the preceding sec- 
tion. Here we will consider research which 
has measured the attitudes of reward recipi- 
ents toward those who were more or less 
directly responsible for the satisfying state 
of affairs. 

The study cited above by Kleiner (1960) 
bears directly on the proposition here con- 
sidered. Groups of two subjects and one 
“stooge” worked on puzzles, and their ability 
to solve problems was compared with that 
of business executives. In each group the 
stooge was clearly instrumental in improving 
group performance. When the improvement 
produced a large reduction in threat of 
failure, consequent liking and desire to know 
the confederate were significantly greater 
than when the threat reduction was small. 
Attraction to the helpful stooge also tended 
to increase more in groups under initially 
high threat of failure than in groups under 
low threat. A somewhat similar design was 
used to test a related hypothesis by Ziller 
and Behringer (1960). Groups solved prob- 
lems.and were informed of their success or 
failure. Into each group, then, a new member 
was introduced and a more difficult problem 
(which he could solve) was presented. The 
newcomer was rated higher by subjects in 
previously failing groups than in previously 
successful ones; it was in the former groups, 
of course, that his presence was associated 
with the greater improvement in group 
performance. 
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Earlier, Jennings (1950) reported vo 
girls highly chosen by others to live or wor 
with were sources of need gratification, and 
she suggested that choice on such criteri 
“may, in a sense, be considered earned by 
the person chosen [p. 165].” Horwitz (1958) 
found that in an experimental simulation of 
a classroom situation the teacher was signifi- 
cantly more liked and was evaluated more 
positively on personal characteristics when he 
behaved in accord with the students’ desires 
to repeat parts of a lesson than when he be- 
haved in accord with his own desire, contrary 
to that of the students. In a different kind of 
situation (Solomon, 1960), subjects who com- 
municated with unseen partners via electric 
switches developed greater liking for those 
partners whose game strategy provided them 
with maximum gain. Also reported have been 
data supporting a relationship between the 
feeling that one has influenced others (which 
we can assume to be gratifying or rewarding) 
and the liking of these others (Zander & 
Cohen, 1955; Tagiuri & Kogan, 1960). 

Two studies have indicated that behavior 
by one person which is generally considered 
rewarding to another will not be so under all 
conditions. Howard and Berkowitz (1958) 
have reported that when judges disagree in 
their evaluations of a person’s performance, 
the deviant judge tends to be seen as prob- 
ably in error and tends to be rejected on 
a sociometric question regardless of whether 
his evaluation was positive or negative. Re- 
lated findings were reported by Deutsch and 
Solomon (1959), Although subjects in their 
experiment tended, generally, to respond 
more favorably to persons who evaluated 
their task performance positively, they were 
less favorable if they had been Jed to believe 
by another source that their performance was 
poor, and under this condition they re- 
sponded favorably to a negative evaluator. 
It appears as if a “correct” judgment may 
constitute desired behavior in situations 
where individuals received feedback about 
their actions from more than one source and 
adopt either one or the majority evaluation 
as their standard. 

Individual Reward in the Presence of 
Others. A different interpretation of the role 
of success in influencing liking between indi- 
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viduals is that A’s attraction to B he 
depend upon B’s having also a aaa 
within the same group nor upon re snsteath 
mentality in obtaining reward for # i tisfying 
B’s presence when A experiences @ er i 
state of affairs is hypothesized as bel E action 
ficient condition for enhancing A’S oat 
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fortunately, always been — — 
investigations we will consider be 
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miner of his attraction to the antin y was 
Group goal attainment in this 5$ pich WË 
unrelated to individual attainment T 
primarily a function of indiv! a ed V 
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Spector (1956), Among men opera" + adit 4 
laboratory as military intelligent judg? 
teams, those “promoted” to sergea mor ak 
their team as being significantly yithi i 
tractive, and desired to remain ted. oa 
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ther, among the promoted subjects ‘on 
believed the probability of promot”, 
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of their group than were those W hie 
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sociometric test given outside the game i 
tion, the successful children chose i N 
greater proportion of their fellow SOU) jdr 
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Whether the former believed that the s 
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is a question on which, unfortunt nÉ 
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the frequency of reward obtained in the 
Presence of others and by using an additional 
test of attraction to group members. On 
each of the tests significantly more of the 
Subjects who received six rewards chose fel- 
low members than did those who received 
hree or no rewards; no reliable differences 
Were found between the latter two conditions. 
In both this and the previous study the num- 
er of rewarded subjects within each group 
varied, and later examination of the data has 
revealed some tendency for rewarded sub- 
Kects to prefer other rewarded subjects. 
ne Ndications that certain personality factors 
a affect the influence which one’s own 
ee or failure has on our affective re- 
ee to others may be found in data 
Ea by Stotland (1959). Members of 
ratel Sroups first worked together, then sepa- 
Other, In the latter condition some failed and 
5 Succeeded on either a group relevant 
Was geant task, Attraction to the group 
You iL measured by two questions: Would 
Much a to meet with the group again? How 
on p you like the other members? Success 
reliably individual tasks was associated with 
Was fai More attraction to the group than 
jects allure, but among those who failed, sub- 
Ore with high self-esteem found their group 
esteem active than subjects with low self- 
Etween h ere was no reliable flitterence 
Succeedeg and low-esteem subjects who 
Ro. ae findings have been reported by 
differe aum (1959) who found no reliable 
Scale ii in responses to a social distance 
Jeets carding their partners) between sub- 
iscy © failed and those who succeeded on 
Rive, Slon task. Instead subjects who were 
No feedback about their performance 
eir partners significantly more than 
dng given either favorable or unfavor- 
aluations, It may be important that 
derg © Subjects were told that both part- 
Would fail or succeed together, while 
er half were told that if one partner 
at €d the other would fail. No measure 
Co Mteraction between this cooperation- 
Riven, tion variable and success-failure is 
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tend to be chosen for further activity by both 
other successful and unsuccessful persons. 
While this kind of choice does not necessarily 
indicate “liking,” Gilchrist also reported a 
generally greater choice of successful sub- 
jects for social interaction (to speak to at the 
next meeting, have coffee with, double date 
with). Berkowitz (1957) reported, as an 
incidental finding in a study designed to test 
another hypothesis, that subjects liked their 
partners (simulated) better when the latter 
were supposedly proficient on a task than 
when they were supposedly poor, In another 
investigation (Berkowitz et al., 1957) where 
the performance of group members was 
evaluated by “experts,” highly task moti- 
vated subjects generally indicated greater at- 
traction to favorably evaluated than to un- 
favorably evaluated group members on both 
work partner and social companion questions. 
In still another relevant study by Stotland 
and Hillmer (1962), similar data were ob- 
tained: Subjects tended to show a greater 
desire to converse with a hypothetical person 
they were told was good at a task than with 
one they were told was poor. 

What the effect one’s own success or failure 
may have on the tendency to find a successful 
other attractive is an important question on 
which empirical attention needs to be di- 
rectly focused. Existing data have not clari- 
fied the nature of the interaction between the 
variables of own goal attainment and others’ 
in affecting interpersonal attraction. It is 
probable also that the interaction will vary 
from one set of circumstances to another. We 
have already seen that in certain cases un- 
successful persons show a preference for 
others who are “in the same boat,” but the 
general nature of conditions under which this 
occurs is far from clear. We have also seen 
that when a white person and a Negro are 
equally competent on a task, the former tends 
to find the latter less attractive (because it 
violates the white person’s expectations or 
reduces the value of his own attainment?). 
It is possible that under still other conditions 
the variables of own and other’s success may 
operate independently so that an individual 
will be more positively disposed toward the 
members of his group or toward persons in 
his immediate environment following his re- 
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ceipt of reward, but his attraction toward 
other rewarded persons will be even greater 
than that toward nonrewarded ones. Future 
research might profitably be directed toward 
clarifying such questions and focusing on the 
interaction of variables. 


Reduction of Cognitive Dissonance 


It has been suggested that one way of 
reducing dissonance relative to cognition of 
another person is by increasing one’s attrac- 
tion toward him. In such a case it must be 
assumed that alternative methods of reducing 
dissonance are either unavailable or less 
likely to occur for one reason or another. 

Supporting data have come from an inves- 
tigation by Aronson and Mills (1959). They 
predicted that persons who undergo an un- 
pleasant initiation in order to gain admission 
to a group and then find that the group has 
not been worth such pains (a dissonant 
cognition) will consequently exaggerate the 
group’s positive characteristics. An alterna- 
tive way to reduce the dissonance by con- 
vincing oneself that the initiation was not very 
unpleasant was made unlikely by the use of 
a severe initiation condition involving the 
reading aloud of obscene words, University 
girls undergoing this procedure were expected 
to find the group into which they were ac- 
cepted more attractive than girls who were 
accepted without undergoing an initiation or 
whose initiation was “mild.” After the sub- 
jects heard a “very dull” taped discussion, 
presumably being carried on by the group, 
they rated both the discussion and the 
participants, Responses to both measures 
supported the hypotheses, 


It may be important that in this experi- 
ment the subjects were accepted into the 
group. It is possible that acceptance after 
a severe initiation is more valuable, more 
rewarding, than acceptance after a mild or 
no initiation. A con 


dition of no-acceptance 
following the initiation Procedures, with the 


same dull discussion and its Participants 
subsequently evaluated, should provide worth- 
while data. 

Schopler and Bateson 
other alternative explanation of the above 
findings, replicated the 


Aronson and Mills 
study under somewhat altered conditions of 


(1962), to test an- 
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ees two 
subject recruitment and the aiiora that 
new experimental groups. They fo ondition 
subjects in the severe initiation inet 
rated the content of the dull group, i acta 
significantly higher than did ane oui 
mild initiation condition (the contro £ jects 
thus confirming Aronson and Mills. ower 
who received a severe initiation, disparost 
and also heard the experimenter oe atings 
the discussion, did not differ in their B 
of the discussion from the control oe i 
and made reliably lower ratings Working 
subjects in the severe condition. 59) theo- 
within the Tribaut and Kelley (19 explain 
retical system, Schopler and sig severe 
their results by arguing that in the expeti- 
condition, high dependence upon the greatel 
menter is produced and, thus, & ower! 
necessity for subjects to increase their P ji 

W: 

One of the likeliest ways of increasing powe' Sinion 
was available ... was to conform to kae t 
of the E [experimenter], who strongly IMFs chople" 
the Ss [subjects] would like the discussion 


& Bateson, 1962, p. 637]. 


by 
niue K ced 

In the disparage condition, introd ee 
Schopler and Bateson, where the exP f i 


ssio” 
predicted to the subjects that the oe he 
would probably not be good or excit other 
subjects rated the discussion low- d 
condition introduced by Schopler an re coo” 
son yielded ambiguous results, Of mO ect t° 
cern to us are the findings with resp“ J- 
ratings of the discussion participan the 
though severe condition subjects me con 
Participants higher than did either whe one 
trol or disparage condition subjects: 
of the differences were reliable. 


ate- 


L 

TICA 

ANTECEDENTS OF Lixtng: THEORET 
EXPECTATIONS 


ze 

: an! 

The foregoing review of research, 018 pip? 
going (0 ns 


in terms of empirically tested relatio” nse 
between a variety of conditions and © we 
quent interpersonal attraction, shou p i 
believe, have demonstrated at least 07° 

If we define the cohesiveness of small £ pidh 
in terms of the positive judgments | e 
members make of one another, then “ pde 
deal is known about the conditions p: wi 
Which cohesiveness is likely to devel 
Suggest that the conclusion of Cat 


| 
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and Zander (1953, p. 80; 1960, p. 79) that 
out T “very little systematic knowledge 
o e conditions which heighten cohesive- 
Presen $ not an accurate reflection of the 
7h ee of affairs, although perhaps we 
en a Temove the qualifying adjective 
hawy’s ‘on We must, similarly, question 
aS statement that little sound 
o iren been carried out in an attempt 
share oe group development. We do 
is" ‘and Shaw, the conclusion, that there 
Ure re] €ncy to ignore much of the litera- 
i evant to the problem under considera- 
Search a m both theoretical works and re- 
doubt he This state of affairs is no 
ing Lit ated to the vastness of the exist- 

erature as well as to its generally 

€oretical nature, 
o ee now to a consideration of theory, 
Words H Which have been made, in other 
basis 4 © Predict specific relationships on the 

a set of more general principles. 


aay 
2 n 
Ce or Congruity 


Hei 
en Most clearly identified with the 
Questigh sition as explicitly related to the 
i of attraction between persons is 


cide 
that ~ (1946, 1958). His most basic concept, 


sitų oñ the “balanced state,” refers to a 
Detson. „ 2 Which the perceived units (i.e., 
and got à subjective relationship, or person 


Senti ae object or idea) and the experienced 


We wi g 
tiong Will consider here only those predic- 


Tepenge Which sentiment formation is the 
i Simi] t variable: A person tends to like 
ktit ay Individual (but not when the simi- 
tiong Carries with it disagreeable implica- 


vidu l 3S, for example, when the other indi- 
h Perso the same disability as the person) ; 
. n tends to like another with whom 
Action contact (but not if continued inter- 
rie, pls the perception of dissimilarity 
Mili he Other); A person tends to like a 
“tian Other (but not if the other is dis- 
t). Tn each of the above hypotheses it 
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is B’s similarity to A which underlies A’s 
attraction to him. Further, if another indi- 
vidual’s behavior is perceived as congruent 
with a person’s values, the person will tend 
to like the other. Whether similarity is im- 
plied by “balance” or whether it has some 
other value is not clear. We may note 
Heider’s statement that “. . . it is satisfy- 
ing to find support for one’s own view [1958, 
p. 196],” but we are not sure whether sup- 
port for one’s own view is seen as “satisfy- 
ing” a person’s need for balance; that is, 
whether because A relates to B they should 
be similar to one another to achieve a bal- 
anced state or whether support for one’s view 
“satisfies” a different motive. Since balance 
is said to be achieved “if the relations be- 
tween . . . entities are all positive [1958, p. 
202],” it may be that Heider considers 
similarity to be intrinsically positive. 

There are two other relevant hypotheses 
which are, at least superficially, unrelated to 
similarity: A person tends to like another 
whom he has benefited and also one who has 
benefited him. To be benefited by a person 
one dislikes or to benefit someone one dis- 
likes would obviously be disharmonious or 
incongruous. 

All of the above predictions have been 
supported to varying degrees by empirical 
data, although surprisingly few investigations 
have been explicitly designed to test Heider’s 
system. More often, the hypotheses tested in 
relevant research have been unrelated to any 
systematic point of view or have stemmed 
from other positions. Heider’s influence is 
apparent, however, in other theoretical 
formulations. Newcomb (1953), for example, 
has similarly built a system around the basic 
principle of balance, postulated a “strain 
toward symmetry,” and stressed the role of 
similarity between persons as the major 
determinant of attraction. We interpret New- 
comb’s more recent position, however, as 
more relevant to a reward framework and 
will, therefore, return to it below. 

Festinger. The increasingly influential 
theory of cognitive dissonance has been ap- 
plied to the prediction of behavior in social 
contexts in a paper by Festinger and Aronson 
(1960). This theory, too, focuses on balance 
and rests upon the basic assumption that 
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individuals who experience simultaneous 
cognitions which are contradictory or incon- 
sistent with one another will tend to reduce 
this incongruity (dissonance) by changing 
one of the cognitions in the direction of 
consistency with the other. 

Regarding the problem with which we are 
here concerned, Festinger and Aronson have 
suggested that liking for an individual or 
group of persons may be a dissonance 
reducing mechanism. When, for example, a 
person has invested time or effort in order 
to interact with other persons and then finds 
that they are less pleasant than he had 
anticipated, he is faced 


with dissonant 
cognitions. 


There are at least two ways that a person could 
reduce dissonance in such a situation: (a) he could 
undervalue the amount of his investment . or 
(b) he could overvalue the group by emphasizing its 
positive aspects ... [p. 220] 


We have previously considered evidence 
relevant to this hypothesis. 


Reward 


It is possible to find reinforcement proposi- 
tions within the more recent theoretical state- 
ments of a great many social psychologists 
(e.g., Allport, 1962; Bass, 1961; Miller, 
1963). It is, for example, taken for granted 
that individuals are attracted to groups as 
a direct function of the satisfaction they are 
able to derive within them, Our particular 
interest here is in more explicit propositions 
regarding attraction to individuals, 

Lazarsfeld and Merton, Although these 
writers have stressed common values as the 
dominant factor in friendship formation, the 
variable of reward seems, nevertheless to be 
more primary. “For those with similar 
values,” Lazarsfeld and Merton (1954) sug- 
gest, “social contact, because it is eae. 
ing, will motivate them to seek further con- 
tact.” In some cases friendships will form 
before the individuals become aware that 
they differ on particular issues. If there have 
been sufficient prior rewards, that is, suf- 
ficient areas of similarity, then some dis- 
agreement can be tolerated, but the tendency 
will be to modify opinions and come closer 
together if the friendship is to endure, since 
“common values make social interaction a 
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rewarding experience.” They suggest that o 
most critical evidence with respect to on 
question might well be obtained from data 
broken friendships. á 
Newcomb. Resembling the above 1 de 
eral outline, Newcomb’s position is more A 
tailed and refined. He has chosen to dis F 
guish between attitudes toward objects <n 
attitudes toward persons, using the mb. 
“attraction” to refer to the latter (AR 
1956), and postulated that the reinforce cll 
which one person experiences as he inte dent 
with another is the major indepe? the 
variable determining attraction to con- 
other. Specifically, positive attraction 15 ate 
ceptualized in terms of reward-associate s of 
titudes, and negative attraction in ena 1 
punishment-associated attitudes Oerein 
1960). Thus, behind the relationship Pe 
propinquity and attraction, according en 
Newcomb (1956), is the fact that ratio 
persons interact the reward-punishment than 
is more often such as to be reinforcing ace 
extinguishing.” In the same vein pre!® like 
for persons resembling others whom W° gen 
is accounted for by the “principle sds 10" 
eralization,” that is to say, thresho s and 
interaction with such persons are oa ction 
“the likelihood of the rewards of inter tha" 
with such persons is greater [1956 
for others, tribi” 
Perhaps Newcomb’s most unique CO” pee” 
tion to systemization in this area Þa con” 
his emphasis on the role played bY gsbiP: 
munication in the maintenance of ien ai 
Communication is recognized as the jaual® 
interaction process through which indiv ate 
can reward one another, In addition, # 


gen- 


cur 
eo 
communication is, in itself, rewardine, t0 


consequently “reward qdlue will attat ihat 
the co-communicator—which is t0 5® peas? 
positive attraction toward him will 
(Othier things equal) with frequen? Ei 
accurate communication with him [19 "au? 
Similarity between persons, again i 


it is rewarding, is said to account cor 
ttractgj. 

r more of the variance in interpersonal * p, a 
an does any other single variable [195 y 


: „aip g 
Newcomb has refined this relations jvt 
noting that similarity, to be an 


me 
determiner of interpersonal attractio” 
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refer to attitudes toward valued objects, that 


's, those which are “important and relevant 
[1958] > 


An indivi 
individual tends to be attracted toward others 


who Paa E 

thin on Seen as viewing as important the same 
i ihe that he himself regards as important, and as 
hi & attitudes toward them that are similar to 


own. Upo 
“Scanned” pon first 


jud 
a ver i a 

at Betas Tange of objects. At first, therefore, 
i n should be more closely related to per- 
agreement. Given ample 
jud; or association and communication, 
D. lp tend to become more accurate [1958, 


Nee predictions have been verified in 
tom re (1961) later research. 
relates e above it is clear how Newcomb 
tiran nity to similarity and then to 
Ptoximitn’ It is only to the degree that 
Perceived. i facilitates the development of 
i tnpa arity of attitude [1956]” that 
i also Ibutes to attraction. Communication 
tends related to similarity since the latter 
legar oe from the former, at least with 
both te the object of communication, and 
a deas communication and similarity 
Ih like Pendent sources of reward (1958). 
ttracteg neh the fact that individuals are 
feei ee to persons who reciprocate their 
Similari Seen as “a special case of perceived 
to pouty,” assuming that self-attitudes tend 
avorable, 
the “womb (1960) is also concerned with 
Cane, “8orles of reward which are signifi- 
ting; Mterpersonal relations and has dis- 
among admiration (A attributes 
berga Atacteristics to B), reciprocation 
su ort elves B as liking him), and perceived 
to his (A attributes to B attitudes similar 
Wouj a: A’s attraction to group members 
terg; hie ca „depend upon two major parame- 
(his 3 3 ierarchy of ‘appetites’ for reward” 
Otives), and “the reward possibilities 
er members have to offer.” 
vide considerable body of data already pro- 
Newa Pport for the major predictions within 
Ptaa S theory. Most valuable for eee, 
ee meee we believe, be exploration a 
Oneq vet in which the parameters men 
h det, © the preceding paragraph function 
‘mining attraction. 
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Homans. In his recent book, Homans 
(1961) has strongly embraced modern learn- 
ing theory as a source of principles regarding 
social behavior. As compared with Newcomb, 
therefore, Homans’ theoretical propositions 
are more explicitly phrased in behavioral 
terms, but both systems depend upon the 
basic concepts of reinforcement and pun- 
ishment. 

Homans is concerned with both activities 
(overt behavior) and sentiments (overt 
behaviors which provide visible signs of 
internal attitudes and feelings), but since the 
former concept includes the latter, no sepa- 
rate sets of propositions are required. A pri- 
mary assumption is that the activities and 
sentiments which one individual emits in 
response to another are reinforcing or punish- 
ing to some degree or “more or less valuable” 
to the other (p. 34). Special stress is placed 
on social approval (a sentiment) as a gen- 
eralized reinforcer. The value of a reward 
to a person will depend upon his relatively 
constant past experience (his genetic and 
cultural character) and his present state of 
deprivation. “Costs” are subtracted from re- 
ward to measure the individual’s “profit,” or 
net reward. 

In Homans’ terms, 


cohesiveness refers to the values of the different 
kinds of rewards available to members of the group: 
the more valuable to a group’s members are the ac- 
tivities (or sentiments) they receive from other 
members or from the environment, the more cohesive 


it is [p. 88]. 


With regard to interpersonal attraction, the 
following major hypothesis is derived from 
the general assumptions: The more valuable 
a person’s activities are to others, the greater 
is the esteem in which he is held (p. 164). 
From this, in turn, follow certain specific 
expectations, for example: Persons will pro- 
vide more value to one another if they share 
the same orientations, or have similar back- 
grounds (p. 214 f.); Because some individuals 
provide the most value to others, mutual 
choice will be greatest among high esteem, 
or overchosen, persons (p. 164 f.); Individuals 
tend to prefer high status individuals be- 
cause such individuals have most likely ac- 
quired their status by rewarding others, and 
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hence they are associated with the expectation 
of reward (p. 314). 

Homans has suggested, further, that men 
tend to value “fair exchange” or “distributive 
justice.” This “is realized when the profit 

. of each man is directly proportional to 
his investments: such things as age, sex, 
seniority, or acquired skill [p. 264].” From 
this follows the specific expectation that a 
condition of status congruence among per- 
sons will be conducive to mutual esteem or 
congeniality. 

The relationship between interaction and 
favorable sentiments is also explained on the 


basis of the value derived from the inter- 
action: 


the chances are that cach one [of an interacting 
pair, for example] will find some of the other’s 
activities valuable, if only because they may be 
obtained at less cost from him than from a third 
party at a greater distance: the distance of a 
source of reward adds something to the cost of 
getting it [p. 183 f.], 

Homans has considered the cı 
findings which have been obtained with re- 
spect to the liking of leaders, He Points out 
that a man “earns authorit 


i y by acquiring 
esteem, and he acquires esteem by rewarding 


others [p. 314],” but since a man holding 
authority is likely also to incur costs on 
others by use of punishment or simply by 
depriving others of the chance to be the 
leader, the sentiments which develop toward 


him are likely to þe ambivalent (p. 299 f ) 
Homan offer ar 


ontradictory 


similar 


© (1961) book 
ppeared subse- 
Kelly, we are 
stems in the 
m to fall best 


ao continuum of 
similarity. 


Like Homans, Thibaut and Kelley analyzed 
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social interaction in terms of rewards on 
factions or reduced drives) and costs Na 
cal or mental anxiety which serves to E 
performance). In addition, they intro das 
the concept of control, which is describen 
mediated by one’s ability to affect the 
comes of another’s behavior. Thus, 


viable... or de 
the participants have for contributing to 
ing from each other's outcomes [p. 41- 


j K Kelley 
A unique feature of the Thibaut =A two 


approach is its distinction between to 
evaluative standards which persons a 
the outcomes, that is, to the rewa" terms 
costs, of interaction, One standard is 0 sed 
of what the person feels he desen m tive 
on his past experiences and presen io othe! 
tion (the comparison level), while t nd Ë 
is in terms of available alternatives 
the lowest level of outcomes which ; ki 
son will accept (the comparison ag 
alternatives), A person's dependence De 
group (or another individual) is 54! ji 
a function of the latter, but his ate 
the group or other individual depen? y 
the former. The two evaluative prar i 
however, are not independent. “Sio n i 
comes from the best alternative rian 
are included among those that deter™” Chat 
CL [comparison level], the CL am tend 
[comparison level for alternatives] K 
to be positively correlated [p- 1021. n 
then, will be dependence and attractio reh 
It is with attraction that we are cont posts 
and Thibaut and Kelley (p. 24) Piep” 
that a “person’s attraction to a dyad in el 
upon his evaluation of his outcomes ions. 
tion to his CL...” Specific rean tepi 
between antecedent variables an ro H 
sonal attraction are explained in a H 
both “ability to reward each othe! a it 
Cost of providing this reward.” het E 
arity is important to friendship Þe cfa iA 


1 
ee a 
many value areas an individual ÍS Wig 
of support”; agreement is therefore ied 
ing, 


“In other words, provision © att 
Support may be considered as havin8 
reinforcement value [p. 42].” Wher? ds ° 
mentarity contributes to the rew 
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tained from interaction, then it will be posi- 
tively related to attraction (p. 47). Similarly, 
We tend to prefer high status people because 
they “have greater extrinsic means . . . for 
ewarding others,” or association with such 
Persons may simply provide the reward of 
Prestige [p. 48]. 
ion Reinforcement Formulations. The 
Ea S of Homans, of Thibaut and Kelley, 
oa : a somewhat lesser extent, of New- 
social ne a ere with the entire sphere of 
a cae ‘avior. We have selected only what 
A ved to be most relevant to the present 
em, Other formulations have been more 


Nar A 
ae Towly concerned only with attraction 
Ong persons, 


ag bitone (1958), for example, maintains 
i the Simple common assumption that at- 
taction t 


amount o an individual “depends upon the 
ion? a of reinforcement’ or ‘need satisfac- 
too tained from that person . . .” is far 
lene to provide an understanding of 
What nants. “What we want to know is 
Sion particular factors reinforce, reduce ten- 
nt “sl - .” He has suggested that impor- 
(for ‘mensions are those of responsibility 
Or newt act), intentionality (i.e. good, bad, 
€), and Justifiability (on some logical, 
Ditone ethical standard). According to 
tee of these has a predictable 
eg “ty „P With attraction: A’s attractive- 
"esponsibi Will vary with the degree of B’s 
With ps ility for positive or negative acts, 
justifaps Jntentions, and with the degree of 
The ility of the positive or negative acts. 
Vidua] 2Ctots vary the value to one indi- 
They oi the specific behavior of another. 
it ig Weight the value (valence) of acts, and 
the a © value of his acts which determines 
relate tractiveness of the actor. Status is 
Nediati to attraction, for example, via the 
s ton of the above constructs, since high 
Sible ¿persons will be seen as more respon- 
ving oe an improved state of affairs, as 
i thei 800d intentions, and as being justified 
ehavior, 
“arli mumber of empirical studies, referred to 
nq hi in this paper, designed by Pepitone 
have 'S co-workers to test his predictions, 
'elded generally positive results. 

Rro Potheses regarding attraction among 
Members have been derived from the 
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general propositions of Hullian behavior 
theory by Lott (1961). Attraction is re- 
phrased in terms of positive attitudes and 
defined, following Doob (1947), as implicit, 
anticipatory goal responses. On the basis of 
a sequence of learning theory principles, it 
is expected that stimuli associated with re- 
inforcement can eventually evoke the implicit 
and anticipatory component of the response 
to reinforcement, the goal response. Hence, 
the “primary condition for the development 
of mutual positive attitudes among group 
members” is said to be “the attainment of 
goals or the receipt of rewards in one an- 
other’s presence.” Secondary hypotheses re- 
garding antecedents take into account fre- 
quency of reward and delay of reward. It is 
also predicted that the more closely or the 
more frequently a particular group member 
is related to the achievement of goals, the 
greater will be the strength of the positive 
attitudes developed toward him. This latter 
hypothesis may be interpreted as relevant to 
status and leadership. Studies designed to 
test this formulation have been cited earlier; 
other investigations are in progress. 

Overview. There is clear agreement among 
many contemporary theorists that attraction 
will follow if one individual either directly 
provides another with reward or need satis- 
faction, is perceived as potentially able to do 
so, or is otherwise associated with such a 
state of affairs. Furthermore, the specific 
antecedent variables which empirical research 
has shown to be related to interpersonal 
attraction can, for the most part, be inter- 
preted in support of this general proposition. 
The data on propinquity, group climate 
(including cooperation, democratic leader- 
ship, etc.), acceptance, status, personality 
characteristics, similarity (or complemen- 
tarity), and success are all predictable from 
a general reinforcement position. It is more 
difficult to deal with the research findings on 
mutual threat or failure. If it can be shown 
clearly, however, as some investigators have 
suggested, that mutual threat increases inter- 
personal attraction only under conditions 
where the possibility exists of a cooperative 
solution to the common problem, then the 
latter condition could be identified as the 
source of reward. . 
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CONSEQUENCES OF LIKING: EMPIRICAL 
RELATIONSHIPS 


The investigations which are considered in 
this section may be roughly divided into the 
following types: 


1. studies in which natural, existing groups 
are first measured for the degree of inter- 
personal attraction among their members, 
and scores on this measure are then related 
to some other variable; 

2. studies in which groups assumed to 
differ in level of liking among members (as 
for example, groups of friends and groups of 
strangers) or groups known to differ on some 
variable which is assumed to reflect inter- 
personal attraction are compared on some 
other variable or dependent measure; 

3. experiments in which interpersonal 
liking is explicitly manipulated through in- 
structions or some other Operation, and its 
consequences for some dependent variable 
are subsequently ascertained ; 

4. experiments in which interpersonal 
liking is indirectly varied through the direct 
manipulation of some related condition, and 


measures are then taken on a dependent 
variable. 


Expression of Aggression 


In an early study, French (1941) observed 
groups of strangers and groups of team or 
club members in a frustrating situation in 
which the group attempted to solve insoluble 
problems, A variety of measures indicated 
greater expression of interpersonal aggression 
within the “organized” than within the 
Stranger groups, without major disruption of 
group functioning. Similarly, Wright (1943) 
found that pairs of children who 


friends expressed more aggression toward the 


low-cohesive groups. 
members of the 
e “an exceptional 
l matched, com- 


ng well; opposite 
expectations were presented to members of 


low-cohesive groups. Following provocation 
by a rude and insulting assistant, high- 
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cohesive pairs were found to express greater 
hostility than low-cohesive pairs. , 1958) 
It has been suggested by Berkowitz ( ae 
that liking among group members ea “ 
effect of lowering inhibitions with — s 
aggressive behavior. He has shown, ae 
in two investigations (Berkowitz & a 
1962; Berkowitz & Holmes, 1960) that ere 
conditions where aggression is displaced oo 
a person other than the instigator, er 
tion of aggression will vary positive A na 
the degree of dislike of the person. a 
first mentioned study, subjects who ha pi 
frustrated and insulted by the erpen ait 
administered more shocks to disliked 


l 

«active chec 

ated disliked partners on an adject mate 
list more unfavorably than they ev erson 


rce 
tside 


ill ten! 
others whom he likes, but that he ae 


not to displace aggression onto liked Pe vail- 


ed persons are as readily 
able? 


Self-Evaluation 


ime 
Manis (1955) has shown that, a ber 
there is a greater increase in agreeme male 
tween the self-concepts of a sample sai of 
freshmen and their friends’ percept?" ir 
them than between self-concepts 2” u 
nonfriends’ perceptions of them. Two $ ide- 
(Maehr, Mensing, & Nafzger, 1962; e o 
beck, 1960) varied the evaluation issu 
subjects’ Perfo ne K 


ing a tendency to be positively attrac ons 
an expert, th 
may 


1S e 
end to change sock cor” 
rection of atie pim » 
l h the evaluation made © 
a liked other. econ 

Negative evidence, however, has F ject” 
been reported by Pilisuk (1962)- of the, 
tended to retain favorable estimates rse 
own performance in the face of adve" 
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cism, sy 
fleni ise aes the criticism came from a 
Was alle ie a stranger. When the criticism 
Wever to have come from a friend, 
tange of subjects made use of a wide 
friends? aoe to explain their 
Shapiro (49. havior. Harvey, Kelley, and 
Subjects to $ 7) and Harvey (1962) exposed 
information sens of fictitiously negative 
Similarly ‘hee. ONG themselves and found, 
rom a fiend whether this information came 
difference in or stranger made no significant 
in the ars Subsequent self-ratings. Subjects 
empted to S investigation, however, at- 
ions Ste friends’ derogatory evalu- 
reevaluate Pi from strangers’ and tended 
Y reeval tiends more negatively than 
then: oF uated strangers who had rated 
(Harvey eee In the earlier study 
It was fou al., 1957), on the other hand, 
CRative nd that strangers who had made 
Wently go latione tended to be subse- 
Made oe whereas distortions were 
t? these Po Negative ratings themselves 
doe. clear, th esumably came from friends. 
ie ot ra that criticism by a friend 
plua ion, Wi fably lead to changes in self- 
vae cater A sled negative evaluation leads 
digg tion of et at rationalization or de- 
Cite ing Sent source or the use of different 
is cism is a ly when the source of the 
Not ear tiend in contrast to a nonfriend 
by lf evaluat: 
A Co i was found to be influenced 
vay OY Kipni uations of their friends in a 
val tions a (1961). She compared self- 
syp tions of male freshmen with their 
ag ects tended others and found that the 
haq” Week A to describe themselves, after 
Sup: Previous e in the same way that they 
trai S who h described their best friends. 
then their Te ascribed relatively “good” 
elves ce est friends as compared with 
Ea ae their self-evaluations in a 
Pisgah i, and those who had 
oa hese to their best friends 
pinot se f-descriptions in a negative 
t 
tot c “al small group of investigations has 
oy Maney ed with evaluation of own per- 
dye bility a a function of the performance 
Mussen liked others. Data reported by 
and Zander (1954) from a field 
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study of secondary school teachers indicated 
that those who evaluated their own teaching 
performance in a manner which deviated 
from their perception of the standards of 
their subgroup made higher failure scores the 
more they were attracted to the grou 

Failure was defined in terms of the daan 
ancy between own performance and ideal 
performance. Festinger, Torrey, and Willer- 
man (1954) varied the scores made by group 
members on an experimental task and ob- 
tained support for their hypothesis that the 


stronger the attraction of members to a group, the 
stronger will be the feelings of inadequacy on the 
part of those scoring less well than others and the 
stronger . . . [the] feeling of adequacy on the part 
of those scoring as well or better... [p. 173] 


An experiment by Stotland and Hillmer 
(1962) has shown that identification is not 
necessarily related to attraction. Subjects 
who read that another person had been good 
at a clerical task evaluated their own per- 
formances on the same task more favorably 
than did subjects who read that the other 
person had done poorly. This effect was re- 
ported to be independent of the level of 
attraction to the model, although there was 
an overall preference for the “good”? over 
the “poor” model on a “desire to converse 
with” question. 

In general, the research on relationships 
between liking and self-evaluation has 
yielded equivocal results pointing to the need 
for further clarifying investigation. 


Evaluation of the Situation 

Marquis, Guetzkow, and Heyns (1951) 

have reported, from a field study of decision 
ignificant positive cor- 


making conferences, Sl 
relations between a cohesiveness index and 
members’ satisfaction with the group process 


and with the meeting. The cohesiveness index 
included observer ratings of liking among 
group members. Jordan (1953), in an investi- 
gation designed to test Heider’s balance 
hypothesis, found that where two persons 
were described to subjects as liking one 
another, the situation which the hypothetical 
persons were in was rated as more pleasant 
than when the two persons were described 
as not liking each other. The relationship be- 
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tween persons had the most weight in influ- 
encing judgments of the situation, more 
weight than balance or imbalance in terms 
of agreement or disagreement between the 
two persons with regard to some object. 

In a study by Exline (1957), subjects 
who had been told that they were in groups 
with well-matched, congenial others expressed 
greater satisfaction with their group’s prog- 
ress than subjects given the opposite orienta- 
tion toward their fellow group members. 
Similarly, Libo (1953) had shown earlier 
that subjects told they were to work on a 
volunteered-for task with congenial group 
members differed significantly from subjects 
given the opposite instructions in their desire 
to stay with the group. Libo (1957) has 
also reported a significant relationship be- 
tween the attraction of patients to their 
therapists, following their first interaction, 
and whether or not the patients returned 
for a second visit one week later. 

That persons tend to respond favor, 
situations in which liked others ar 


ent appears to be an empirically 
generalization, 


‘ably to 
e pres- 
sound 


Evaluation of Others 


Perception of Similarity, A large number 
of investigations have provided evidence that 
persons will tend to perceive liked others as 
similar to themselves, The well known early 
study by Fiedler et al, (19 


i 52) confirmed the 
hypothesis that subjects will perceive fellow 


like best as more 


m teams (Fiedler, 1954), college freshmen 
ipnis, 1961), and sorority girls (Back 
& Secord, 1962). Lu : =y 


well, and Shoemake 


lity descriptions of Posi- 


end to be more 
acceptable than to their 
unacceptable self-descriptions, while the re- 


verse is the case for descri 


ptions of negative 
sociometric choices. In a 


quite different re- 
search context, the investigation of marital 
conflict, happily married persons have been 
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found to differ from unhappily icant aot 
in attributing greater trait similarity 
spouses (Preston et al., 1952). 
The same tendency to perceive 
persons as being like oneself has 
ported with respect to values and dents at 
Precker (1953) found that — 4 
tributed to their chosen advisers a their 
evaluative criteria which were similar childre®? 
own; and Davitz (1955) found aon their 
at a summer camp tended to pae camp 
most preferred peers as liking the sa! Wong 
activities that they did. Byrne am highly 
(1962) obtained data indicating tha 
prejudiced white subjects assume wed 
attitude dissimilarity between themse se 
an unknown Negro than between paii whi 
and an unknown white person WM 
persons low in prejudice assumed 


nni . . ro as 
degree of similarity with a a oa show": 


hat Pe 


preferred 
been Te 
attitudes: 


a white stranger. Tt has also bee 
among groups of Navy personnel, js wh! 
sons tend to assume that individua gan © 
they like also like each other (08 
Tagiuri, 1958), - relevant 
A number of other studies appear varied: 
here, although liking is not explicit ie 
Bieri (1953), for example, had 50 gree "i 
of subjects discuss together and se! and 
Conclusions about common experien ate 
preferred activities; control subjects A o BEE 
these topics individually, back-to-ba ae 
dictions about partners’ responses , wer? 
Rosenzweig Picture Frustration Tes U 
made before and after interaction. simila 
latter occasion, significantly greater subje” 
ity was found for the experimenta the 
between their own responses an 
sponses which they predicted for 
ners. In another investigation ee a 
1959), subject pairs who had worke 
cooperatively were reported to have 
quently assum 


ge 
subs 
a e? 
ed greater similarity n 
other on personality traits than subje tudi? 
had worked competitively. 
(Smith, 1958- Stotland, Zander, & ns “oy 
1961) have demonstrated that pers? etre 
are made aware of some similarity erce" 
themselves and another tend tO Ý 
additional similarity, jded ws 

Negative evidence has been pie tt 
Lundy (1959), Subjects discussed 4 


$ 


51 
Two © ula 
Nats gh? 


à 
d P 
ter 
d and 
ves 
hil hite 
the sa” 
w 
at T 
& 
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os first with one student, then with 
jects a a one of the interactions, sub- 
ad to ed to believe that their partners 
iting a ed them for the discussion, while 

e op na other, subjects were led to believe 
one’s E Assumption of similarity with 
scales) y ner (on the Allport-Vernon value 
of i found to be a function of order 
action) e (greater on the second inter- 
acceptance ut not to be associated with the 

Se €-rejection variable. 

i i Judgments of the Behavior of 
Mutter ee Horowitz, Lyons, and Perl- 
ace dis ol} reported that, in a face-to- 

embers Ussion group, agreement by group 
relateq Aa one another’s acts was directly 
X disliked Whether the “actors” were liked 
openers finding emerged only from 
Verbal ‘ee data, however, not from the 
Sion, n otd obtained during actual discus- 

Sh different kind of situation, a sum- 
Perform. P for boys, it was found that the 
leti Paa of high-status boys on an ath- 
rs. tended to be overestimated by their 
boys’ iile the performance of low-status 
l95 7 „underestimated (Sherif et al., 
nce w variations in judgments of perform- 
skill par not significantly related to actual 

Splayed on the task 
Wo : ask. 

Ptovige dies by Berkowitz (1956, 1957) 
Within ep ditional relevant data. High liking 
fitst iny €ctive air crews was found, in the 
Mem erg gation, to be accompanied by 

e anog erCEPtions of high motivation in 
Mtilizi ‘other. In the second investigation, 
Me Dat ated groups, high liking be- 
str Pie (manipulated by means of 
eniatit Ons with regard to probable con- 
af Mpaniet one’s fellow members) was 
Pinions ied by high regard for one another's 


er 

has pabtual Sensitivity, Suchman (1956) 
Pers ne ted, from a study of three five- 
howe Student groups, that “people who 
Morg . More favorability toward others were 
txi accurate in estimating their feelings.” 
ret (1957) similarly found that group 
Poh who were told that they were well 
em pe and congenial were more accurate 
iWin embers of noncongenial groups in per- 
loys the task-oriented behavior of their 

Members, Also somewhat relevant 15 & 
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study by Eisman and Levy (1960); they 
found that lip-reading was more accurate, 
the more the communicatee liked the com- 


municator. 

In a different kind of investigation Schutz 
(1961) composed groups so as to be homo- 
geneous (compatible) with respect to mem- 
bers’ interpersonal needs; after six meetings, 
group members were able to correctly iden- 
tify the interpersonal characteristics of their 
group when the common characteristic of the 
members was expressed behavior rather than 
behavior desired from others. 

From data obtained in studies not directly 
concerned with perceptual accuracy but with 
assumptions of similarity or judgments of 
others’ behavior (cited earlier), it is evident 
that subjects sometimes assume too much 
similarity to persons they like or over- 
estimate the goodness of their performance. 
Liking, therefore, may lead to inaccurate per- 
ception, although it may be associated with 
an increased sensitivity to the behavior and 
characteristics of liked others. Differential de- 
pendent measures of sensitivity and accuracy 
are needed. 

Perceived Reciprocal Attraction, From a 
variety of groups in different research set- 
tings (Tagiuri, 1958; Tagiuri, Blake & 
Bruner, 1953; Tagiuri, Bruner, & Blake, 
1958), Tagiuri and his co-workers have re- 

orted that the tendency toward “congru- 
» that is, the tendency to assume that 
e or negative feelings toward an- 
other are reciprocated, is the most powerful 
relationship obtained and is significantly 
greater than would be expected on the basis 
of chance (a robot model). Tagiuri (1958) 


ency, 
one’s positiv 


wrote: 


g to ask whether congruency 1s a tend- 
le to feel chosen by those whom they 
to choose those by whom 
dual 


Tt is intriguin 
ency for peop! fe 
choose, or contrariwise, by w 
they fed chosen. Undoubtedly, there are indivi 
differences in the order of this cycle [p. 322]. 


however, Tagiuri argued for the 
f the former tendency, since indi- 
viduals are usually first aware of their own 
feelings toward one another. Data relevan, 
to the perception of “reciprocated attraction’ 
have also been reported and discussed by 
Newcomb (1956, 1958, 1960). 


In general, 
dominance 0 
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Communication 


The hypothesis that persons communicate 
more with liked than with disliked persons 
has been supported by a number of investi- 
gations but the results of other studies have 
indicated that in situations where opinions 
differ, and where there is pressure toward 
uniformity, individuals tend to increase their 
communication to deviates. In some of these 
latter investigations measures have indicated 
that the deviates are disliked or rejected 
while in others this must be assumed. 

Kelley (1950) found that a larger propor- 
tion of students took part in a class discus- 
sion led by a substitute teacher when they 
had been informed that the teacher was 
“warm” than when they had been given the 
opposite information. Back (1951) compared 
high with low cohesive pairs (manipulated 
by instructions) with respect to communica- 
tion patterns in discussion of a story derived 
from pictures and found significantly more 
“withdrawal” patterns among the latter, 
Grossack (1954) has reported that more 
notes were exchanged by subjects under 
instructions to “cooperate” with fellow group 
members tan by subjects instructed to 

compete” on an experimental task. In still 
another relevant experiment by Cervi 

(1956), the “solidarity” ee 

y”? of three person 


y having stooges 


tons expressed by 
ent measures indi- 


topic supplied by the in 
Runkel (1962) has re 
tionship between the level of Tespect existin 
between secondary school teachers and fey 
often they communicate with one another; 
the causal direction of this relationship, 
however, is not clear, , 

An investigation by Mellinger (1956) has 
provided interesting data not on frequency 


Vestigators. F inally, 
ported finding a rela- 
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of communication but on the nature of com- 
munication as a function of interpersonal at- 
traction. It was found that scientists in 4 
research organization tended to conceal their 
attitudes about a particular issue when com 
municating with persons in whom they Jacke 
trust. 

In studies of small groups under ie 
to achieve uniformity on a given aac 
or problem, it has generally been found u 
communication tends to be directed more s- 
deviants than to other group members, E 
pecially in relatively cohesive or homogeneo 
groups (Emerson, 1954; Festinger e A 
1952; Festinger & Thibaut, 1951; Sehac miy 
1951). More recently, Altman and en 
nies (1960) have reported that both heis 
ened pace of discussion and frequency ly 
opposition-directed remarks were positive 
related to members’ valuations of ue 
groups. 

In the investigations cited in the fo 
paragraph, communication directed by , 
members to deviants consisted, primarily» 
attempts to influence a change of opinion: e 
is probable, then, that the relationship tion 
tween interpersonal liking and communic® 
will vary with the conditions under whic? 
interaction takes place, 


regoing 
group 
re gi 


Uniformity-Conformity 


Most of the findings support a a 
relationship between the degree 0" cons 
Personal attraction existing among a 
and their consequent uniformity ag l 


3 jue døm 
to particular opinions, attitudes, judg" 


or other behavior, Both attempts £0 influe 
and successful 


influence by, liked Po 
have been shown, These two processes: Á rat? 
appear logically and empirically to ° 
together, are usually not separated, pu pi 
Investigators haye obtained measures yea 
fluence attempts as well as of Te " 
changes in behavior, i 


ited 
(1951) is often © ures 


ere 
ns with regard to Pger 
rection, or group P 
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but whether the attempted variations in co- 
be Were successful is not clear. When 
os Partners were taken into account, no 
Snificant differences were obtained between 
Be ee and low cohesive pairs on a mea- 
altho, omge following influence attempts, 
reliably in high cohesive groups there was 
Partner. more change on the part of one 
er technique of manipulating group co- 
Btobable ee telling subjects that they will 
anothe y like (find congenial) or not like one 
invests, has been used in a large number of 
OMS ea of the attraction-uniformity 
es E ae Festinger et al. (1952) told mem- 
each Re Oe groups that they would find 
Sting er compatible, congenial, and inter- 
Vell, § and would most likely get along very 
Dressy uch groups were found to exert greater 
ith CS for uniformity of member opinion 
cussion d to a particular issue under dis- 
Orientati than groups given the opposite 
r ing Similar instructions were used by 
Workeq (1957) in a study where subjects 
an Econ simulated group conditions, 
Ma Ormity to the responses of partners 
late bg was found to be significantly re- 
ivesti liking of the partners. In an earlier 
itep ettton Berkowitz (1954) had related 
toup mber liking with the persistence of 
Menta "ductivity standards for an experi- 
Schacht. “SK similar to one utilized by 
(1951\° Ellertson, McBride, and Gregory 
i ) * » Mc. ? 
ù hip This latter study found that persons 
Mem er Cohesive groups followed presumed 
ag Suggestions for low productivity to 


if t 
barg Gp cantly greater extent than did mem- 
Pressure W Cohesive groups, but when the 


i : 
“es k Was for high production, cohesive- 
Other © no difference. 


liki Tesearch in which interpersonal 
ny S not been explicitly manipulated 
op ss Considered relevant to the present 

att lf we assume variation in the level 


ple, “ction among subjects. Thus, for ex- 
atte ” Ovard (1951b) found significantly 
h; ‘vergence in perceptual judgments 
ea amo Mbers of group-centered classes 
op se as ng members of leader-centered ones. 
ng tud © types of classes differed in amount 
A et interaction. Similarly, Festinger 
aut (1951) reported that among 
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group members told that they were with 
others similar to themselves in interest in 
and knowledge about a given problem there 
was greater change in the direction of opinion 
uniformity than among group members given 
the opposite instructions. Homogeneity was 
associated with increased uniformity only for 
opinions on a football strategy problem, how- 
ever, and not for those concerning treatment 
of a delinquent boy. In a study by Grossack 
(1954), some subjects were instructed to 
work competitively and others cooperatively. 
The findings provided support for the experi- 
mental hypothesis that cooperating indi- 
viduals (assumed by Grossack to manifest 
a cohesive group structure) will make more 
attempts to influence others and will accept 
more pressures in the direction of uniformity 
than competing persons, Kidd and Campbell 
(1955) varied the success of groups on an 
anagrams task and found, subsequently, that 
subjects in “success” groups differed signifi- 
cantly from those in “failure” groups in con- 
forming more to purported group averages 
regarding estimated light flickers. Finally, 
Dittes and Kelley (1956) have reported that 
group members who received ratings of aver- 
age acceptability from their fellows tended to 
show more conformity behavior than those 
who received low or very high ratings. 

Another group of studies have provided 
data on differences in conformity behavior 
between groups which more clearly differed 
in level of interpersonal attraction. Thrasher 
(1954) investigated variations in perceptual 
judgment as a function both of gradations 
in stimulus structure and interpersonal rela- 
tions. He found that the judgments of pairs 
of friends “were significantly more influenced 
by each other’s judgments than the judg- 
ments of pairs of individuals wo pe not 
so positively involved (neutrals). : ke 
the case for stimuli of both intermediate ai 
low structure value. From a quite differen 

i dy of secondary 

research setting, a field study A 
school teachers, Rasmussen and Zan er 
(1954) have reported that the ea 
between perceived group standards (regar e 
i i formance) and own ideal was 
ing teaching pert apr 
smaller, the more attracted was a teacher 
his group. From another field study, of 
decision-making conferences, it was concluded 
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that the achievement of high consensus is 
associated with warm and friendly personal 
feelings among the participants (Guetzkow 
& Gyr, 1954). Thibaut and Strickland 
(1956) compared conformity behavior within 
groups of positively associated individuals 
(fraternity pledges) and within groups of 
strangers. Under varying pressure of believing 
oneself to be in the minority with regard to 
judgments, the greatest positive relationship 
between pressure and conformity was found 
to occur in those “pledge” groups which had 
been given a “group” as opposed to a “task” 
orientation. A more recent investigation by 
Zander, Natsoulas, and Thomas (1960) re- 
ported that subjects who were most attracted 
to their group were also ones whose own 
aspiration level (or personal goal), with 
regard to an experimental task, was highly 
congruent with the goal of their group, In 
an investigation of natural student groups, 
Lott and Lott (1961) obtained a significant 
relationship between the degree of mutual 
positive attitudes existing among group mem- 
bers and their conformity to a purported 
group standard with respect to an opinion 
question. 

One experimental approach to the question 
of uniformity pressures and group cohesive- 
ness or of intermember attraction, has been 
the demonstration of less tolerance for, or 
greater rejection of, deviates in high cohesive 
groups. Schachter (1951) obtained data sup- 
porting this hypothesis in a study where co- 
hesiveness was manipulated through the use 
of differential Instructions and stooges ex- 
pressed points of view at variance with 


© parts of a student 
ed that in one there 


i » negative correlation 
between a sociometric measure of cohesive- 


ness and the percentage of deviates (with 
respect to a particular issue) in residence 
courts, and that deviates in these courts 
tended to receive fewer sociometric choices 
than those who shared the majority opinion. 
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In the other portion of the project, howevet, 
neither of these findings was obtained. fea 
Still another empirical variation has see 
the demonstration that individuals are ni 
resistant to changing their opinions if ere 
are shared by members of groups to W 5 
they are attracted. Gerard (1954), for E 
ample, reported that a paid participait ma 
challenged the opinions of subjects, ea¢ fec- 
whom represented a group, was more è at- 
tive in influencing those who were o an 
tracted to their fellow group members. 


t kart 
earlier investigation, Kelley and cae a 
(1952) had obtained similar data {10m 


natural setting. A sample of Boy hasis 
heard a speaker criticize the Scouts’ ETP hose 
on camping and woodcraft. In general, per- 
boys who valued their Scout troop on a 
ship least changed most in the direct! s 
negative attitudes toward these activities 
We have classed together a gront jr 
studies in which influencer (model) Pearly 
fluencee (imitator) are distinct and ats of 
identifiable. Taken together, the ae sup” 
these investigations provide considerabie ual 
Port for the proposition that an in p ge 
will tend to match his behavior to tha osse", 
liked other, In a laboratory study, T of 
Polansky, and Lippitt (1951) had P% of 
boys work together on block-designs- ried 
the boys, an experimental collaborator, Y% to 
his behavior with different partners 5°. Jow 
produce the conditions of high oe p 
friendliness. Under the former conditi mor? 
compared with the latter, there WaS 
imitation by the subjects of certain @ 
of the collaborator’s behavior during 92) 
periods, Lippitt, Polansky, and Rose? ` gt 4 
have also reported, from a field stu te m 
children’s camp which replicated ear por 
search, that the boys investigated were se ‘0 
likely to imitate the behavior of a hat 
whom they attributed high power a ate 
attributed power choices were highly J ant 
to judgments of personal liking. Harro : 
Consalvi (1960) had groups of boys f° pe” 
school for delinquents participate 7 ene, 
ceptual judgment task and found, in 2 or 
that the higher a boy’s status (i.e. th? eon 
Preferred he was as a teammate g whi 
captain), “the greater his proportion’ ot)” 
ence on members’ judgments . - *’ 
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vestigation of children at a summer camp 


(Cohn, Yee & Brown, 1961) obtained 
oe of food and activity preferences and 


ei otsata attraction. Over a 4-week 
Hs de a significant change in preferences 
most Ta in the direction of similarity with 
of en ked peers, In addition, a high degree 
eito ait was noted between actual 
attitud ehavior at meal time and measured 
evaluati, changes. Change in one’s self- 
Percei ‘on so as to match the characteristics 
reported, in one’s best friend has also been 
teshm, » among a sample of male college 
nen (Kipnis, 1961). 
stir Studies have shown imitation of a 
individua] Sr presumed previously neutral 
Crease TA under conditions designed to in- 
investi is attractiveness. For example, in one 
lege oe (Kelley & Woodruff, 1956) col- 
whig dents listened to a recorded speech 


Norm: 


n eudience heard applauding the speaker 
ollege. of faculty and alumni from the 
8e; others were informed that the ap- 

s ng audience was made up of anony- 
directio Siders, More opinion change in the 
der n advocated by the speaker took place 
tion t2€ former than under the latter condi- 
la d urdick and Burnes (1958) manipu- 
lectur another variable and found that a 
to after speaking pleasantly for an 
cha ie x group of students, was effective in 
pleas their opinions, while after speaking 
Rican antly on a later occasion, he was Sig- 
dents» less liked than previously and stu- 
irec Opinions tended to change in a 
urdik °PPosite to his. (In the same paper 
tive rel and Burnes reported finding a posi- 
Measure 2PShip between | emotionality, as 
isap by the galvanic skin response, 
ste; ement with a liked experimenter.) 
pa Stotland, and Zander (1961) have 
Cd that grade-school children tended to 
eir preferences regarding deep sea 

i , ore after an adult described as 
Similar to them in background and 
attributes than after a less similar 
n nder conditions where subjects were 
i larp med about the adult’s degree of 
Y to themselves, but were told about 
diving ability, preferences of the 
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model were more accepted by the children as 
their own when he was described as capable 
than when he was described otherwise. Sug- 
gestive data have been reported by McDavid 
(1959) from a study of imitation among 
preschool children: A female model was found 
to be initially more effective in providing 
behavioral cues than was a male model. 
Another and more recent investigation of 
nursery school children (Bandura & Huston, 
1961) varied the behavior of the female 
experimenter so that some of the children 
experienced two warm and nurturant play 
sessions, while the others experienced two 
cold, nonnurturant ones. There was more 
imitation of the experimenter’s verbal and 
motor behavior by children in the former 
condition (except for aggressive behavior 
which was readily imitated by all subjects), 
Although the empirical evidence favoring 
a positive relationship between interpersonal 
attraction and consequent behavior similarity 
is extremely impressive, there have been 
negative findings as well. Bovard, who had 
reported a positive relationship between 
interaction and convergence on perceptual 
judgments in one study (1951b), found in a 
later one (1953) that conformity on another 
perceptual judgment task was not reliably 
related to a measure of the average attraction 
of group members to their group. McKeachie 
(1954) compared group-centered with leader- 
centered classes and obtained equivocal re- 
sults with respect to congruence between indi- 
vidual attitudes and perceived and actual 
group norms. Downing (1958) has reported 
failure to obtain a significant cee a 
level of conformity between members al an 
and low cohesive groups mn an autokinetic 


nse situation. , ; i 
<> n among industrial work 


In a compariso { 
groups differing in cohesiveness, ag 
( i low groups di 

that high and g S 
found g A 


(1954) 


not differ reliably in within group var 


with respect to members’ ee ee 
reasonable production level, but the wi hin 
group variance in high cohesive groups was 
significantly less than in low groups with re- 
Je actual production. Burdick (cited by 
Walker & Heyns, 1962, pp. 30-40), has also 
reported both negative and positive findings 
under different conditions. High liking for 


spect to 
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other group members was found to be posi- 
tively related to nonconforming behavior in 
one experiment and to conforming behavior 
in a second experiment. In the latter case, 


conformity was made instrumental to being liked 
by other persons .. . by adding to the instructions 
the point that it is common knowledge that Persons 
tend to like one another more, to get along better, 
when they agree about common problems [p. 34]. 


Task Performance 


As was the case with Tespect to uniformity 
of behavior, there exists an extensive litera- 
ture devoted to the relationship between 
interpersonal attraction and productivity, 

One experimental technique has simply 
compared groups of friends (or positive socio- 
metric choices) with groups of nonfriends on 
some measure of productivity, An early study 
by Husband (1940) found superior perform- 
ance by pairs of sixth- and seventh-grade 
friends on word puzzles, jigsaw puzzles, and 
arithmetic problems, At three summer camps 
in which workgroups were formed according 


to choices made so that Congeniality was 
maximized, 


cussions - and more work accomplish 
plished 

[Faunce & Beegle, 1948].” Van Zelst 

(1952a, 1952b), experimenting in a natural 


n, grouped some Carpenters and 
into mutual choice pairs and 


Goodacre (1951, 1953), for example, reported 
significant positive correl 
ous indices of group 
scores made by combat and 


rifle squads on 
field problems, Similarly, 


Berkowitz (1956) 
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has reported a significant relationship i 
tween aircrew members’ attitudes toward o 
another and two criteria of their actu 
combat effectiveness; and Chapman a 
Campbell (1957), in a laboratory ra 
found that, “Ratings of a person’s desira dy 
ity as a teammate correlate significan is 
with the success of the teams of which ae 
a member.” Positive findings have am i 
reported by Darley, Gross, and D a 
(1952) from a study of women’s res! 956) 
units and by Gardner and Thompson (1 
from a study of fraternities, y n 
Supportive findings may also be cited i 
investigations in which the independent Y in 
able may be assumed to reflect variation 


5 suc 
interpersonal attraction; in soe: ee ae 
variation has been shown, explicitly, 


Ai or 
company the manipulation of conditions. P 
example, Sacks (1952) administered 5 
vidual intelligence tests to three grouP 
nursery school children. The tester nsbip 
Previously established a “good” relatio or” 
with the children in one group and a ae 
relationship with the children in @ att at 
group, and she had had no prior conta ores 
all with the remaining children. The. T the 
of the children who were familiar bt ha 
tester were significantly higher than thes ‘orm 
been 10 days earlier on an alternate for 
of the test whereas this was not the ca etest 
the control children; moreover, the ; 
increases of the “good” relationship CH | 
were significantly greater than the ine dine 
of the “baq” relationship children. ACCO 
to Sacks, qualitative observational da e the 
Port the fact that the children differed ” the 


intended way in their attitude tow! upe” 
experimenter, In a study of a chain 0! > p 
markets, Clark (cited by Homans, ! atio” 


255) obtained an almost perfect Cor"? on 
between an index of social ease (statu F 
Stuency of “bundler” and “ringer”) = 956) 
index of labor efficiency. Cervin (! ww? 
varied group “solidarity” by havina pion? 
stooges reinforce or extinguish the OP TH 
of the critical member of each group; ever 
Same subjects “showed better and ae 1 
performance” when in the solidarity u ilit 
the dissolidarity condition, The compa! s00? 
of Sroups, defined by similar interp?" wi 
orientations among group members 
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varied by Schutz (1955) and by Moos and 
Speisman (1962). Data from both studies 
show a tendency toward better task per- 
ormance by compatible groups. Schutz’s 
results also indicated that the difference be- 
tween compatible and incompatible groups 
Varied with the type of task. In an investiga- 
os by E. E. Smith (1957), group produc- 
; ity was shown to be negatively affected by 

© Presence of two silent members in five- 
Person experimental groups; the silent mem- 
ets were perceived as being unpredictable. 

That individuals who work together co- 
Pperatively are more productive on some cri- 
terion than group members who compete 
*sainst one another has been demonstrated 
for classroom groups by Deutsch (1960) and 
Y Smith, Madden, and Sobol (1957). Blau 
(1954), interested in the same variable, 
compared groups of employment agency 
Nterviewers and found, paradoxically, that 
Competitiveness and productivity were “in- 
versely related for groups but directly related 
or individuals in the competitive group.” 
The Competitive group was less productive 
than the group in which there were coopera- 
tive norms and in which competitive tenden- 
cles were curbed, but within the former 
Stoup, the more competitive individuals were 
relatively more productive. 

Not directly concerned with actual per- 
Ormance, but with individual aspiration 
levels for group achievement, is an investiga- 
tion by Ex (1959). Subjects who worked 
With a sympathetic partner (a congenial 
confederate) were found to have higher levels 

aspiration for their joint performance than 
Subjects who worked with an unsympathetic 
Partner, 

Not all of the research findings in this 
area have pointed to the same conclusion. 
Th an early study by Philp (1940), for ex- 
ample, in which pairs of kindergarten chil- 
Ten who preferred each other or were 
Strangers worked on simple tasks, the 
duality of the response made during the 
Wo types of pairings” was reported to be 
‘ty different, but there were no reliable 
ferences in average performance or range 

Performance. In a far different setting, a 

Study of actual conferences, cohesive 

“Oups were found to be no more productive 
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than less cohesive ones (Marquis et al., 
1951), and among industrial work groups a 
tendency was found for either high or low 
production among high cohesive groups, de- 
pending upon the group’s standard (Seashore, 
1954). That high cohesive groups can be in- 
duced to lower production or perform Poorly 
has been demonstrated in experiments by 
Schachter et al. (1951) and by Berkowitz 
(1954). 

Atthowe (1961), who measured the co- 
hesiveness of decision-making pairs by their 
number of “we” versus “I” remarks, also 
found no relationship between this measure 
and task performance, while Adams (1953) 
has reported a curvilinear relationship be- 
tween the degree of status congruence within 
bomber crews (which varied positively with 
friendship) and measures of their technical 
performance. It has also been suggested that 
under certain conditions there is a negative 
relationship between interpersonal liking and 
efficient task performance. According to 
Fiedler (1953), “quite in contrast to thera- 
peutic and social relations, we find that psy- 
chological closeness and warmth in key mem- 
bers of small groups seems to be a detriment 
to effective team work . . .” under conditions 
where teams “want to get a job done.” Simi- 
larly, Stogdill (1959) has suggested that “the 
effort that is devoted to the development of 
integration might be conceived as a subtrac- 
tion from the efforts that are devoted to pro- 
ductivity [p. 269].” Supporting data may be 
found in investigations of machine operators 
in a shoe factory (Horsfall & Arensberg, 
1949), of surveyor teams (Fiedler, 1954), and 
of rifle teams (McGrath, 1962). Hoffman 
(1959) and Hoffman and Maier (1961) have 
varied the homogeneity of members’ person- 
alities in problem-solving groups and found 
that nonhomogeneous groups tended to pro- 
duce superior and more inventive solutions. 
In these studies data on intermember liking 
were not obtained. 

Equivocal results have been reported by 
Yuker (1955), who used group recall as a 
measure of group productivity. In an inter- 
esting experiment, the leadership (demo- 
cratic versus authoritarian) and cooperation- 
competition dimensions of groups were varied, 
and individuals recalled alone and collec- 
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tively a story which had been read to them 
while they were assembled in their four- 
person groups. The difference in accuracy 
between group recall and initial individual 
recall was found to be significantly greater 
in cooperative than in competitive groups, 
but there was no difference between demo- 
cratic and authoritarian groups, and neither 
the cooperation-competition nor the leader- 
ship variable was found to affect differences 
in accuracy between final individual recall 
and initial individual recall. Yuker’s data, 
however, suggest that his groups did not 
differ reliably in cohesiveness. In a subsidiary 
analysis, Yuker found that the 10 groups in 
which there had been the greatest improve- 
ment from initial to final recall differed from 
the 10 groups in which there been the 
least improvement on three questionnaire 
items reflecting cohesiveness, 

Findings relevant to the relationship be- 
tween liking and task performance are con- 
tradictory. It seems likely that in a task 
situation other variables such as the demands 
of the situation itself (instructions or job 
specifications), the standards of performance 
preferred by liked co-workers, and the degree 
to which sociability may interfere with the 
required behavior for a particular job, may 
be highly significant. 


Learning 


We are not concerned here with the ques- 
tion of whether learning in a group situation 
is superior to learning when alon 
discussed often in the earlier ]j 
Lorge, Fox, Davitz, & Brenne 
more recently by Gurnee ( 
est is rather in those in 
compare groups differing 
attraction among members o; 
variable. Such studies have 
In a variety of learning situations and have 
yielded generally equivocal r 


neral esults, 
The major independent variable i 


by Perkins (1950) and Rehage (1951) was 
group- versus leader-centered classroom cli- 
mates. In the former, positive results were 
obtained with respect to the learning of child 
development material by in service teachers: 
in the latter, eighth-grade students under 
the “teacher-pupil planning” condition were 


e, a question 
terature (see 
r, 1958) and 
1962). Our inter- 
vestigations which 
in the level of 
r in some related 
been conducted 


n studies 
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found not to differ significantly from e 
under the “teacher planning” condition rs 
the “amount of social studies subject ee 
ter learned.” Calvin, Hoffman, and Har te 
(1957) investigated the effect of pine 
(permissive, democratic versus on ee 
sive, authoritarian) on group problem avie 
The trends in three experiments supP is 
their hypothesis that a permissive a 
superior to a traditional one for subje a 
with high intelligence, but a handicap 
subjects with only average intelligence. 
In a recent investigation by Shaw 1 0 
Shaw (1962) the sociometric composition j- 
three-member groups of second-grade their 
dren was varied. The children, iN an 
groups, studied lists of spelling words ati 
were tested during both an early 4? 
period of interaction. Spelling SCO i 
found to be positively correlated bore 
index of group cohesiveness or interme were 
liking only on the first test; they 
unrelated on the second test. pee? 
A number of relevant studies have = 


and 


and Sjoholm (1957) found a higher 
learning (for “mildly hostile” words) PY 4 py 
jects whose reinforcements were provia® 
an attractive, soft-spoken young la 
by subjects reinforced by a very mas 
former Marine captain, No data ye 
vided on the subjects’ attitude towa" 
two experimenters, On the other hand; piel 
tive findings have been reported by jonin 
and Karas (1959). Prior to the condit ested 
of “we” or “I” some subjects had bee? did 
by the experimenter and told that the? ome 
well, or poorly, or told nothing, while y 
subjects were given no prior experie” 

the experimenter. Those subjects wont? 
interacted with the experimenter condi 


ex 
pane ` n 
Significantly better than those in the jiabl? 
perience gro; 


: up but there were nO reond” 
differences among the former in rate A que’, 
tioning or final response level. Althous a jur? 
tionnaire results indicated that the ee 
Subjects considered the experimenter pes 
unfriendly and a source of discomfort igh! 
Subjects also said that they were more "" 
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Motivated on the conditioning task. Using a 
Similar design, Simkins (1961) obtained 
= cap results, Hostile words were reinforced 
oy Subjects had had prior experience with 
S S When the reinforcement 
or “the Lof the experimenter’s saying “good” 
those ats fine, the “failure” subjects (i.e., 
mec ee criticized by the experi- 
fe ) conditioned better than any other 
ad Subjects; when points or pennies 
e as reinforcers, conditioning was not 
Wei to the experimenter’s prior behavior. 
varied t Krasner, and Ullman (1960) also 
a hostil ue experimenter’s behavior to induce 
Critical € or a positive atmosphere prior to a 
ostile series of conditioning trials. The 
İvenes condition served to decrease respon- 
uite s but did not affect reconditioning. 
kata similar findings were reported by 
sped and Whitman (1962). In this study 
ents menter varied his expressive move- 
Succes, fant the conditioning procedure and 
i ully induced perceptions of “warmth” 
Under Yar by different subjects. Those 
nificant] e “warmth” condition produced sig- 
Plural y more words, but the number of 
Not aff nouns (which were reinforced) was 

“tected, 
T another kind of manipulation was 
Were mi by Sapolsky (1960). Some subjects 
con Old that they would be working with 
iven age experimenter, while others were 
the suc € opposite instructions. A measure of 
tive pess of the manipulation yielded posi- 
fou dings, and response acquisition was 
than i be significantly greater for high- 
tug, OY low-attraction subjects. In a second 
of sub, epolsky varied the “compatibility” 
ther, Ject-experimenter pairs by selecting 
Pun On the basis of scores on Schutz’s 
tati i ae Interpersonal Relations Orien- 
taine “Behavior questionnaire and again ob- 
the qa POSİtive results. These data provide 
“Ons; rarest support for the proposition under 

eration, 
report Must cite, finally, some unique findings 
Qeted | by Horwitz (1958) who has con 
fectg ¿2 Series of investigations on the ef- 
or ae the veridicality of one’s expressions 
der Mi or dislike, In one study, a teacher 
tj Ted to students’ desires under one con- 
n and was, therefore, power enhancing 


S 
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with respect to the students, while in another 
condition the teacher was power reducing and 
always acted upon his own desires. These 
manipulations, designed to produce liking or 
disliking of the teacher, had proved successful 
in previous investigations. Then, by manipu- 
lating information about group attitudes 
toward the teacher, either a veridical or non- 
veridical group norm was developed. Subse- 
quently, on a type-setting task, the number 
of trials required to achieve two consecutive 
perfect performances was found to be sig- 
nificantly lower (i.e., faster learning occurred) 
under the veridical than the nonveridical 
condition. In other words, whether the stu- 
dents liked or disliked the teacher did not 
matter so long as the group norm was the 
same as the attitudes which were generated 
by the experimental manipulation of the 
teacher’s behavior. 


Overview 


Compared with the empirical findings re- 
garding the antecedents of liking, those rele- 
vant to consequences are far less clear-cut 
and definitive. This is reflected in the relative 
lack of theoretical concern with the effects 
of attraction, as we shall see below. Since 
applications of so-called group behavior prin- 
ciples are often urged in such fields as group 
psychotherapy, education, and community 
relations, it is vital to distinguish between 
validated and unvalidated hypotheses or to 
differentiate between principles which have 
been relatively well substantiated by research 
and those for which the empirical evidence 
is weak or contradictory. In the former cate- 
gory might be placed generalizations concern- 
ing the effect of interpersonal attraction on 
evaluation of the situation, on evaluation or 
perception of persons who are liked, and on 
uniformity of behavior or opinions on certain 
issues, perhaps those that are most relevant 


to the personal relationship. Hypotheses link- 
e expression 


ing interpersonal liking with th s 
; communica- 


of aggression, self-evaluation, 5 c 
tion, task performance and learning require 
additional refinement. In need of systematic 
coordination within some general frame- 


work are all of the variables which we have 


considered here. 
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CONSEQUENCES OF LIKING: THEORETICAL 
EXPECTATIONS 


It is surprising to find that so little has 
been written of a systematic nature regarding 
the predictable effects of liking between indi- 
viduals or among members of a group. Fur- 
thermore, in much of what has been written 
it is difficult to identify underlying general 
principles; the predictions seem, for the most 
part, to be more in the form of empirical 
generalizations, and consequently, one theo- 
retical position cannot easily be distinguished 
from another. There are exceptions and some 


attempts at genuine derivation from sets of 
limited, broader principles, 


Festinger 


Festinger has not been concerned with 
interpersonal attraction, per se, but with a 
number of related social phenomena. He pre- 
dicted, for example (Festinger, 1950a), that 
as forces to remain within a group increase 
C.e., as the group’s attractiveness incr 
pressures to communicate will also i 
Social communication, according to F 
is a means by which individuals 
their opinions with one another 
validating them against the standard 
reality) and is a means by which in 
may more effectively Progress towar 
goal. Both of these communication: 
needs (or forces) are presumably 
high-cohesive or more attractiy 
Elsewhere, Festinger (1954) mor 
hypothesized that the greater the 
ness of a group, 


eases), 
ncrease, 
estinger, 
compare 
(thereby 
of social 
dividuals 
d a group 
-Provoking 
greater in 
e groups, 
e explicitly 
attractive- 
nt it is as 
respect to 
he amount 
n individual 
§ on him to 
> in other words, 
ractiveness of the 
tivities, 


the more importa: 
a standard of comparison with 


abilities and opinions, Further, t 
of opinion change on the part of a 
is a function of the forces actin 
remain in the group, or 
is a function of the att 
group, its members, or ac 


Heider 


On the basis of the balance concept which 
underlies his predictions of interpersonal 
behavior, Heider (1958) has proposed that 
when one person likes another the following 
consequences will be probable: (a) Similarity 


ALBERT J. LOTT AND BERNICE E. LOTT 


ith 
will increase between the two peoa a 
respect to their attitudes pata = from 
events, or other persons, mes ee a 
changes in the liker, changes in his P (b) The 
of the object of his liking, or iota ee 
liked person will be approached or afited of 
and the liked person will be err e 
helped and will be perceived as bene 
liker. 


Newcomb eciall y 


In Newcomb’s theoretical papers (esp! plicit 


: x 
1953) can be found considerable nso 
discussion of the effects of interp propo" 
liking. His “strain toward ae, greater 
sition leads to the prediction that t another 
the valence of one person (B) 1e con- 
(A), or “the more intense one Pe a jikeli- 
cern for another,” the greater 1S to 3 
hood that A will be “eae onment, 
“orientations to objects in the envir will be 
and that symmetry of orientations as the 
achieved. Perceived symmetry, aS ee a 
achievement of objective symmetry. s atti 
as a function both of “intensity ° 


: t 
n 
toward [issue] X and of attractio 
B.” 


owa" 


15 
; comb 
An important refinement in ane z 
position is represented by his hypo je 2 
there will be a greater range of su js $ ral? 
issues “with regard to which there more ie 
toward Symmetry” between persons pe 04 
tracted to one another than between $ wal 
less so, and for the latter ae iat a 
Symmetry is limited to those partic ire by 
Co-orientation toward which is aie Jat 
the Conditions of association.” In jn 4 i 
Paper, Newcomb (1960) proposed, Wit 
tion, that A will tend to endow vel 4 
“favorable attributes” if A is posit” ig P 
tracted or has a favorable attitude t0 


Thibaut and Kelley 59) 


19 
These theorists (Thibaut & Kelley» cone 
are concerned 

quences of on 
another, rather e 
tion to another, Although dependenc’ gely 
attraction are S 

correlated, th 
defined and r 
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cohesi. ; ; 
fonesiveness, for example, is said to result 
oa a condition of high interdependence 

ong group members, a condition in which 


e $ 
R member has high power over the other, 


can 
tee the other through a relatively great 
er of reward-cost units above [p. 114] 


th x 
XC other’s comparison level based on alterna- 


t =o 
thet Fes relationships. It follows, then, 


hi i 

4 pp hesiveness both the ability to make demands 

low © ability to resist them are greater than in 
cohesiveness [p. 114], 


teeing a potentiality for conflict under 
ever ormer condition. Cohesive groups, how- 
flict: ely find a procedure to avoid con- 
others ne member will have more power than 
issues | OF the members define a range of 
erib on which conformity is expected. When 
Means 4 have power over one another, this 
each that they have the ability to reward 
other; this ability 


Seems to 
rd 


Tewa Create further ability to do so... A will 


SAL f ìn order to get B to produce behavior that 
ditions nds rewarding, By so doing, A creates con- 

is ma dir Which, through association learning, 
The Bri ely to learn to like this behavior himself. 
learnin eater A’s ability to reward B, the more this 
8 will occur [p. 115]. 


thi. P 
his way, members of cohesive groups 


ov ‘ 
and toward similarity in values, attitudes, 
behavior. 


O¢ 
her Approaches 


mak Positive attitudes among persons de- 

in learning-theory terms as anticipatory 
ties re Onses having cue and drive proper- 
for’, Ott (1961) has predicted consequences 
È Stoup behavior derived from the Hullian 
the “Work. Since attitudes have drive value, 
the Sreater the degree of intermember liking 
Stoy 'gher should be the general level of 
the  ĉctivity, the quantity (not necessarily 
leyq Mality) of task production, and the 
the E communication., Similarly based on 
Preqi Ssumed drive value of attitudes is the 
take tion that more efficient learning should 
ot o Place in high than in low cohesive groups 
Dre, the part of individuals who are in the 
“ce of liked others. 
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It is also predicted that the “more cohesive 
the group the greater the probability that 
members will develop uniform opinions and 
other behaviors with respect to matters of 
consequence to the group.” This hypothesis 
stems from the fact that stimuli which evoke 
anticipatory responses are thus able to func- 
tion as secondary reinforcers since this prop- 
erty of a stimulus depends upon the stimulus 
being able to evoke an anticipation of reward. 
Persons who are liked, in other words, are 
sources of secondary reward. It can, there- 
fore, be expected that the greater the inter- 
personal attraction among persons, the greater 
is their reinforcing potential, and the greater 
will be “the probability of uniformity among 
them on issues where deviancy is accompanied 
by withdrawal of group members from the 
deviant.” Thus, uniformity is not always 
expected with respect to all issues. Lott and 
Lott (1961) have suggested further that a 
positive relationship between cohesiveness and 
conformity can be predicted 


only in those situations where the stimulus features 
are similar, to some degree, to those which were 
present when group members acquired their sec- 
ondary reinforcing characteristics. The principle of 
stimulus generalization, in other words, should be 
employed to predict the effectiveness with which 
group members can act as secondary reinforcers for 
one another in diverse situations [p. 411]. 


SOME GENERAL CONSIDERATIONS 


A number of years ago Albert (1953) 
raised provocative questions regarding the 
concept of group cohesiveness. He discussed 
the generality of its conceptual definition 
and the diverse and questionable attempts to 
operationalize it and asked, “is cohesiveness 
necessary?” We believe that at the present 
stage of development within the area of 
group behavior, we would do better to in- 
vestigate more clearly definable factors such 
as interpersonal attraction, establishing rela- 
tionships with antecedent and consequent 
variables. We have attempted here a begin- 
ning in this direction. Cartwright and Zander 
(1960) have suggested a similar approach: 


The most important task for group dynamics, is to 
establish a generally accepted set of basic variables 
and concepts having clear empirical and conceptual 


meaning. . . . One part of the problem is to iso- 
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late the actual unitary variables or dimensions that 
make discernible differences . . . The differentiation 
of unitary variables can only be accomplished by 
empirical work which discovers what regularities 
are invariably found among measurements and 
observations [p. 46 f.]. 


We do not intend to throw out group con- 
cepts and to substitute for them concepts 
definable in terms of the behavior of indi- 
viduals, but we believe it to be both more 
parsimonious and systematic to begin with 
the latter, our only source of data, after all, 
Motivation for group membership resides in 
the individual, and the consequences of group 
processes are seen in changes in individual 
behavior. Our thinking although primarily 
influenced by modern learning theory, is in 
accord with other contemporary positions 
within social psychology. Homans (1961), 
for example, maintains that “from the laws 
of individual behavior . . . follow the laws of 
social behavior when the complications of 
mutual reinforcement are taken into account 


[p. 30].” Floyd Allport’s (1962) theoretical 
orientation is similar: 


When the group dynamicist speaks of the “ 
tion of the group for the individual” 
mean just the attraction of the individ: 
another? If individuals are all drawn 


another, are they not ipso facto drawn t 
[Pp. 23-24]? 


attrac- 
does he not 
uals for one 
toward one 
o the group 


Allport has refined his early theoretical posi- 
tion which saw the explanation of social phe- 
nomena “only in the psychology of the indi- 
vidual as he operated in situations with others 


[p. 5]” to a view of groups as collective 
“structures” of articulated behaviors en- 
gaged in by interacting persons. In other 
words, what constitutes a group are “partic- 
ular give-and-take behaviors [p, 13]? 

A decade has Passed since Riecken and 
Homans’ ( 1954) excellent review of litera- 
ture relevant to the “Psychological aspects 
of social structure,” but the need for codi- 
fication of research findings, which they em- 
phasized, is still a pressing one. Our aim, too 
has been to place investigations under a 
“rationally ordered set of headings.” We also 
share with Riecken and Homans the assump- 
tion that “there are no such things as con- 
tradictory findings,” that such findings must 
have been reached in different circumstances 


ALBERT J. LOTT AND BERNICE E. LOTT 


and that the search to identify the specific 
characteristics of these circumstances 1$ a 
of the most potent sources of growth 
knowledge.” 
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REVERSAL LEARNING AND RESISTANCE TO EXTINCTION: 


A SUPPLEMENTARY REPORT 


SALLY E. SPERLING 


University of California, Riverside 


Several very recent and highly relevant articles provide substantiation for some 
hypotheses advanced in a previous review in an analysis of the overtraining 
reversal data. These same data permit the rejection of an hypothesis advanced 
by another reviewer to account for the results. It can now be shown fairly 
conclusively that amount of reward determines the direction of the ORE 
following position response training and that the variables affecting over- 
training reversal data with exteroceptive discriminative stimuli remain un- 


identified. 


In the time between acceptance and publi- 
cation of a previous literature review (Sper- 
ling, 1965), a paper by Paul (1965) on the 
overtraining reversal effect (ORE) was pub- 
lished. This article cited several relevant 
experiments that had not been discussed in 
the Sperling article and advanced a sug- 
gestion for accounting for the differences in 
results obtained in comparing the reversal 
performance of rats after training to criterion 
on the initial problem with that of rats given 
overtraining on the initial problem before 
reversal. During the same period three ex- 
tremely relevant papers were made available 
to the writer (D’Amato & Schiff, 1965; 
Theios & Blosser, 1965a, 1965b), all of which 
have now appeared in print. 

This accumulation of articles permitted 
the substantiation of some hypotheses ad- 
vanced in the analysis of the ORE data in 
Sperling (1965) and the rejection of the 
hypothesis advanced by Paul (1965) to ac- 
count for these data. These seemed sufficient 
reasons to relate these new data in this sup- 
plementary report to those data previously 
discussed. 

Additions to the Experiments Using Extero- 
ceptive Discriminative Stimul 
D’Amato and Schiff (1965) used light-in- 
tensity differences as discriminative stimuli 
in a series of eight experiments in an auto- 
mated Y maze. (This is the same apparatus 
in which D’Amato & Jagoda, 1961, observed 
the ORE.) They manipulated a substantial 
number of variables—degree of stimulus 
change from acquisition to reversal, intertrial 


us Training. 


ʻa of 
interval (ITI), amount of reward, sale ge 
rats, presence or absence of discrimina ec! 
during ITI, and variations designed tO bit- 
the rate of extinction of the initial hal 
and found no evidence of an ORE 
single studies or in the data of all 0 
combined. (It should perhaps be note ste 
the data on reversal learning using. id an 
ceptive discriminative stimuli show eith¢ the 
ORE or no evidence of an ORE. i not 
writer’s knowledge, a reverse ORE E 
been observed for this kind of training gio? 
The present writer shares the con ment” 
and frustration expressed by the expe" (2), 
ets; as in the case of Erlebacher 
there is no discernible reason for the 13 
an ORE in these data. The results re 
D'Amato and Schiff (1965) and Erle con 
(1963) experiments must lead to tPe he 
clusion that the ORE is not a genera, ai ue 
nomenon even for exteroceptive disct! com 
tive stimulus training in rats, The set te n 
ditions under which it can be exPe 
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ed to the Experiments Using Spatial 
(1956 Ex. Response Training. Both Pubols 
ll peng II) and Capaldi (1963) 
training a following position response 
in the 4 aps acquisition was quite rapid 
i ae Wo studies compared to other posi- 
their ae experiments. Subjects began 
in the fir ra run in an average of five trials 
Over ae study and were responding with 
the second ee choices after 16 trials in 
ever, how ee It is difficult to see, how- 
reversa] in would differentially affect the 
Baad Mie! suggested that amount of 
kind of he crucial variable affecting the 
Hllowing eee reversal data observed 
Was i aera response training. If, as 
e fee nesized, the relationship between 
in orcad Nee to extinction of the previously 
reward response and amount or magnitude 
Verse ORE. negative or nonmonotonic, & Te- 
finally an o no evidence of an ORE, and 
reward DRE would be predicted as amount 
© explain : This relationship seems 
nfort e above data. , 
Neither tite Pubols (1956) specified 
reinforce e size nor the weight of his “one 
tapiq Biome pellet,” but the comparably 
apaldi quisition suggests that it was, as In 
the tate (1963), relatively large. In at least 
Was he experiment, the amount of reward 
time) Se (judging from the 15-second access 
bsoy Which undoubtedly accounted for the 
va tion of the ORE. 
(196 a experiments by Theios and Blosser 
tiate p? 1965b) fairly conclusively substan- 
e wa hypothesis about the effect of amount 
SDonse al fi reversal following position re- 
“Ntiate raining, They used both a nondiffer- 
Ta that į maze and one less differentiated 
met expe in Kendler and Kimm (1964). In 
as obser timents in the two reports, an ORE 
oh Lence E for large reward groups, and no 
1 tved f an ORE or a reverse ORE was 
ate diffe in different small reward groups. 
Mo nts rence between the small and large 
Rea Gane reward in these experiments was 
ae effec ugh, therefore, to show both of the 
Sted j ts of variations in this variable sug- 
this n Sperling (1965). 
ES e Same variable also provides a tenta- 
planation of the observation of the 


te 


ORE in the 12-hour deprived group in 
Bruner, Mandler, O’Dowd, and Wallach 
(1958) and the lack of an ORE in the 36- 
hour group. If it can be assumed that the 
eating rate was no faster for the 36-hour 
than for the 12-hour deprived subjects during 
the fixed-access-to-food interval used as the 
reinforcer, then it is perhaps conceivable that 
the amount consumed constituted a larger 
reward on some dimension for the less de- 
prived than for the more deprived subjects 
since it perhaps brought them closer to satia- 
tion. Although these speculations are not 
testable in the data, they provide an account 
of them that is consistent with the effects of 
amount of reward, 

What constitutes a large or a small reward 
in any experimental situation is, of course, 
going to depend upon the deprivation level, 
experimental apparatus and response ae 
quired, training procedures, and perhaps other 
factors as well, but the effect of amount of 
reward, already observable at criterion per- 
formance on the initial problem (cf. Sperling, 
1965), seems without question to be further 
accentuated by increasing amounts of this 
kind of training. 

Comments on the Effects of Correction and 
Noncorrection Training Procedures. Paul 
(1965) suggested that the lack of agreement 
in results among the ORE data may be re- 
lated to differences in the experimental se- 
quence following an incorrect response. Ex- 
periments both showing and failing to show 
an ORE used noncorrection procedures, but 
difference in how the procedures 
st of the former studies 
an incorrect response led to removal from the 
apparatus without exposure to the unbaited 
goal box, while in most of the latter studies 


an incorrect response led to detention in the 
unbaited negative goal box. He speculated 
the experimental se- 


that this difference in 

quence following an incorrect response may 
account for the differences in results in the 
overtraining reversal data even though a few 


experiments reported an ORE when an incor- 
rect response led to an unbaited goal box. It 
can be demonstrated that this hypothesis 
does not aid in explaining the data following 
either position response training or training 
using exteroceptive discriminative stimuli 


there was a 
were applied. In mo; 
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alone; consequently, it cannot handle the 
combined results either. 

In position response training, Theios and 
Blosser (1965a) used noncorrection detention 
for 5 seconds on error trials and observed 
an ORE for large reward and no evidence of 
an ORE for small reward groups in the same 
experiment. In Theios and Blosser (1965b), 
both a standard correction procedure and 
noncorrection detention were used; an ORE 
was observed for large reward and no ORE 
or a reverse ORE for small reward groups. 
These two experiments, then, fairly conclu- 
sively show that the effect of overtraining on 
reversal learning in this situation is predicta- 
ble from a knowledge of the amount of re- 
ward irrespective of the use of either correc- 
tion or noncorrection procedures, 

The results of studies using exteroceptive 
discriminative stimulus situations also do not 
seem to depend upon this procedural varia- 
tion, Neither D’Amato and Schiff (1965) 
nor Erlebacher (1963) observed the ORE, 
and both used noncorrection detention in an 
unbaited goal box on incorrect trials, Among 
others, Birch, Ison, and Sperling (1960), 
Capaldi and Stevenson (1957), and Mackin- 
tosh (1962, 1963a, 1963b) also used this pro- 
cedure, and all observed the ORE. (The ex- 
periments by Mackintosh—1962, 1963a, 
1963b—are counted in the noncorrection de- 
tention studies since subjects were detained 


for 10 seconds in the Presence of the negative 
discriminandum on incorrect choice trials 
even though this detention was on a platform 
in front of the stimulus rather than on a plat- 
form behind it.) 


SUMMARY 


ning using ex- 
li in analyz- 
vertraining on 


the former 
there seems little doubt that the 


variable has now been isolated; 
instance, there seems little doub 
(or they have) not, and that 


instance, 
effective 
in the latter 
t that it has 
none of the 


SALLY E. SPERLING 


. : to be 
more obvious avenues of analysis seems 


leading to fruitful hypotheses. 
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SIGNIFICANCE OF ALL-OR-NONE LEARNING* 


FRANK RESTLE 


Indiana University 


The essential characteristi 

Es a characteristic of all-or-none data is that errors are recurrent events. 

aie = models having this property are derived from cognitive theory 
an theory, and stimulus-sampling theory. In addition, continuous-learning 


none special cases. 


models, including the linear model and Luce’s beta model, are shown to have all-or- 
3 experimental methods—the substitution method, the 


ESPETS experiment, and detailed distributional analysis of simple learning—have 
peed information on conditions that give all-or-none data. Multiple-stage models. 
of more than 1 all-or-none process, can be used to dissect a complex learning 


Process into elementary parts. 


Sin 

verbal o ock’s (1957) paper, a major issue in 
®SSociates , ning has been whether paired 
“autiful a e learned all-or-none or gradually. 
en erect pe of all-or-none learning have 
With sur e | and then smashed by iconoclasts 
5 Tl ae frequency, and now the issue 
i pea e both sides claim victory. 
aS serious} at no party to this controversy 
With an Sly attempted to define the issue 
intuitive generality or detail. It has seemed 
learning y clear what is meant by all-or-none 
been Tea the notion has mainly either 
(tos geet by experiment, as by Rock 
theories ieee within ongoing learning 

iy The ears 1960; Restle, 1962). | 
“|S that ex nt paper will try to determine what 
Biven the Xperiments like Rock’s would prove, 
Ybothesis Ri valid. What is the experimental 
he also oh all-or-none learning? This paper 
th ory mi empt to determine what property 
Cory. Pp ust have to be called an all-or-none 
‘i om this analysis it will become ap- 
at the idea of all-or-none learning 
aig ove hi Some experiments and the idea 
erent “ee experiments may be quite 
x 3 > Unreasonable to assume that all learn- 
Dn ne a Sorts and however defined, is all- 
hes is uninteresting to assert that there 
e, perhaps very special, experimental 
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procedure that yields all-or-none learning. 
The all-or-none hypothesis can be formulated 
with either a very broad or a very narrow 
scope, and of course the scope may also be 
discussed in more detail as to precisely which 
experiments will, and which will not, yield 
all-or-none data. Analysis of scope will show 
that the interesting experimental question is 
not “Is learning all-or-none?” whatever that 
means, but “What is the inner structure of a 
learning task, and how is that structure 


determined?” 
Att-or-NoNE DATA 


Three different experimental paradigms 
have been put forward as tests of the all-or- 
none hypothesis. 

1. Rock (1957) used paired-associates learn- 
ing, beginning with a large pool of items. A 
control group merely learned a typical list in 
the usual way, reading the list of paired as- 
sociates, then responding to the stimuli given 
alone. An experimental (replacement) group 
started with a similar list. The list of stimulus- 
response pairs was read, then a test (stimulus 
only) was given on the list. If any item was 
missed during the test, it was removed from 
the list and replaced with a fresh item. If the 
replacement group learns as rapidly as the 
control, this is evidence for all-or-none learning. 

2. Estes (1960) proposed a miniature ex- 
periment, consisting of one training or rein- 
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forced (R) trial of stimulus-response pairs, 
followed by two or more test (T) trials: the 
R-T-T experiment. If subjects who fail on Tı 
perform no better than chance on Tə, this is 
taken as evidence of all-or-none learning. 

3. Bower (1962) simply taught a list of items 
to a group of subjects, then compared the 
detailed statistics of the data with calculations 
from a mathematical model, the “‘one-element” 
model of stimulus-sampling theory. Tf the 
data agree with this model, it is said that 
all-or-none learning has been observed, 

These experiments all deal with acquisition 
and with items which subjects master com- 
pletely. Consider a sequence of training and 
test trials, all alike, and suppose the subject 
is to be scored as correct or wrong on each 
test. The all-or-none hypothesis, in each of the 
three experiments, asserts that unless the 
subject has completely learned the item, he 
has not learned it at all. Learning itself is not 
observable, but if the subject makes an error 
then he is known not to have learned, Rock’s 
experiment discards the item, presumably 
throwing away any partial learning. Estes 
predicts that if an error has be 


en made, per- 
formance on other tests will be at chance. 
Bower asserts that performance before any 


error is stationary, and that the future pros- 


pects following an error on Trial » are inde- 
pendent of n. 


Since errors are the im 
empirical theory of all- 
concentrate on errors, 
powerful definition comes 
notion of a recurrent event. Let e, 


Ína = P(enga|en) 
faa = Pengo and Cn] €n) 


Jri = Plens and epi; and: sand Cnyi]en). 


That is, fn, is the prob: 
error is on x + i, given 
If fn, is independent of 7 
an error is a recurrent ev 
of an error resets the pr 
error indicates that lear 


ability that the next 
an error on Trial 72, 
1, then it is said that 
ent. Each occurrence 
Ocess to its origin. An 
ning is incomplete, and 
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~ ` is in- 
since learning is all-or-none, if learning 1 
complete there is no learning and mep this 
is back at its origin. Mathematically, only 
definition had the advantage of referring Oa 
to an error and the next error, mee 
complex configurations of data. me 
given the fact that errors are recurren rable 
and the set of probabilities {fi}, apne i 
to calculate the probabilities of all in 
events? A 1963 

Bower (1962) and Suppes and Giñsbeig Oa 
have insisted that in an alorra di 
performance before solution must be 4 EST 
of Bernoulli trials. This would restric 5 are 
to the form f;= abi, where @ a 
probabilities. 


5 is 
arning 
lea ables 


namely, correct responses 
Successive trials. However, 
limited to acquisition of a simple eo 
to an analysis into correct me the 
Sponses. If we want to know W I 
underlying process of learning is “sition 
we must go beyond simple acqui ent 
periments and provide tests a where 
the training conditions. To determi stimul"! 
subject has learned, tests are made © c f 
generalization, memory, change © as use f 
and change of response set (such tion) 
recognition versus recall tests of rete js truly 
It may be argued that if gent pore 
all-or-none, all these different w ate 
show all-or-none behavior. But oa i ny 
necessary that there be no forgetting hat be 
forgetting be all-or-none? It see™ may ge 
requirement of all-or-none forgetting | nor 
different from the requirement of & inv" yo 
learning. Similar uncertainties are pta p 
in dealing with all other exper te My 
cedures other than simple acquisit!?; wg 
complete theory is needed to periere 
interpret more complicated and ! 
tests of all-or-none learning. 


; n psi 
? The argument for this point is based 0P jon? js 
and Supp 


3 es (1959) analysis of the ae e? of 
learning models, in which the basic samP "ign 3° 
composed of Sequences. Probabilities a r SÈ 
events, which are compounded from s “pute 
Sequences, ‘The Probability of any set a prov” ave’ 
qm tiguous trials can be calculated from t resting? 
i4 m a simple way, and from this all inte 
can be constructed. 


ALL-OR-NONE LEARNING 


THEORIES OF ALL-OR-NONE LEARNING 


Cognitive All-or-None Theory 


ae (1957) gave two accounts of all-or- 
results eet In one, he characterized the 
tition a a experiments by saying that repe- 
i lee to play no role. This formula- 
in Which ack to a kind of association theory 
actors ee repetition, and other such 
associata uce associations. In conventional 
o att theory, frequency plays the role 
the ee ng gradual increases of strength of 
ttpetition ne Rock’s first statement that 
i taming į plays no role merely says that 
Nother all-or-none, 

Heimer d discussions, Rock (1957), Rock and 
assert a $ 959), and Rock and Steinfeld (1963) 
em is et theory that a paired-associate 
device : a ly learned by forming a mnemonic 
i ay association.” If the subject forms 
Vell, and association he remembers the item 
tion” S if he fails to form a suitable associa- 
trial, Sains nothing and must try again next 


’ R » 

„a 4 Second theory is typically cognitive 
tion an (a) learning involves active incorpora- 
true s ima ading of the material, and (b) the 
A asii ulus and response may not correspond 
The Sa ble way to the physical identification. 
Psy gf me nonsense syllable takes different 

ack Bical forms, 
toy, a Seneral theory has a mathematical 
ilar to cognitive theories of cue learn- 
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Fic. 2. Graphical illustrations of the usual Hullian 


gradual-learning model and a Hiullian all-or-none 
learning model. 


ing (Restle, 1962). Let H be the set of possible 
associations, mnemonic devices, or “strate- 
gies,” corresponding to a particular pair of 
associates S-R. When shown S-R, the subject 
generates an association, that is, he samples 
from H, taking a Strategy A. If is satisfactory, 
that is, if it enables the subject to remember 
the correct response, then it is presumably 
kept until the next test trial. If Æ does not 
mediate successful memory of the response, 
the subject resamples from the Universe H. 
If the subject resamples always from the same 
Universe H, then all-or-none learning must 
result, for after every failure to master the 
problem the subject returns to an initial state 
of sampling from H. This indicates that errors 
are recurrent events, hence the theory yields 
all-or-none data. The process is illustrated in 


Figure 1. 


Hullian All-or-None Theory 


In Hullian theory the correct response will 
be made to a stimulus if its strength is above 
threshold and no other response is stronger. 
‘All measures of response result from the single 
response strength. Response strength oscillates 
randomly from trial to trial and ordinarily is 
incremented each trial of training. 

To specify the all-or-none version of Hullian 
theory, merely assume that the correct re- 
sponse can take only two strengths, one below 
threshold and the other above (see Figure 2). 

As habit strength shifts from subthreshold 
to maximum, the probability, latency, and 
amplitude of the response also must change 
from one value to a second, as must resistance 
to extinction and forgetting. Forgetting must 
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MEAN RESPONSE STRENGTH 


Fic. 3. All-or-none generalization gradient from 
Hullian all-or-none learning model. 


be all-or-none for a given item and must be 
independent of overtraining on that item. If 
response strength takes only two values, then 
the “generalization gradient” cannot be grad- 
ual but must be as in Figure 3. 

Furthermore, the various tests of response 
strength must all agree, except that they may 
set different thresholds. An example of the 
possibilities is shown in Figure 4, representing 
an easy recognition test, a recall test, and a 


difficult “quick recall” test. Notice that the 
subject gets correct responses before learning 
in the recognition test, not in either recall 
test; and the quick recall test is so difficult 


that the subject only gets about 60% correct 
after mastery. 


This Hullian all-or-none theory 
satisfied by any known experimental 


dures. It was put forth by Postman 
as an explication of the 


learning, along with con: 
disproof. 


is not 
proce- 
(1963) 
concept of all-or-none 
siderable experimental 
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Fic. 4. Illustration of how 
formance can be obtained fr 
model by changes of the test 
changes of response threshold. 


different levels of per- 
om Hullian all-or-none 
conditions and resulting 
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Stimulus-Sampling All-or-None Theory A 
In Estes’ stimulus-sampling theory. r3 E 
ulus is thought of as a set of stimulus ¢ tly on 
Each element is conditioned to ee 
response, and the probability of any „ditione 
equals the proportion of elements p condi- 
to it. Learning consists of all-or-no 
tioning of clements to responses. stimulus 
All-or-none learning arises P mo the 
has only one element. In Bower's onditionė 
theory, the element may be pa miss 
(perhaps, conditioned to some sgl 
response) so that the subject gU 
element becomes conditioned to ah 
response on any training trial wI k 
ilit Figure 5). mp!e* 
ý toa ae syllables a h 
visual figures it is difficult to believe Ar ie 
gives rise to a single element. Estes 
this observation with a ene pa 
Saying that the effective sirmas ie pate 
hence singular though complex. y di 


x: 4 t10! 
elements, how part-whole disri oy ne 
performed, and why learning is 7 variatio 
stimulus variability and irrelevan 

of the stimuli. 


oD! 
gM 
GRADUAL (LINEAR) LEARNIN 


noel 
Pio O 


opel 
ALL-oR-NONE LEARNING M 6 
i f 
TR if 
BEFORE TRAINING AFTER fg 
Fic. 5 aout Y” 


adu’ 
Stimulus-sampling models for 8" f four 


(top) and all-or-none learning (bottom 
items, 


r 


1 Sim 


ALL-OR-NONE LEARNING 


A diff i 
that yon of the simple pattern theory is 
cannot oe each stimulus as unique, hence 
similarity. F predictions of generalization or 
“Pattern pl riedman (1962) has discussed a 
the subject 4 components” theory in which 
makes the on to respond to patterns and 
hown Png response whenever the 
Stimulus i is presented. When a new 
0 response es is presented, the subject has 
omponents ( a new pattern and uses the 
is assumed fe atively numerous elements). 
t response, it hat if a pattern is conditioned to 
tesponse ae components also elicit the same 
bollows the is theory states that acquisition 
accounts m Lat Rone: patted theory and still 

Dies graded stimulus generalization. 

Dlete the ent. the writer has not seen a com- 
Ships of Oretical account of the interrelation- 
the ea subpatterns, and elements. 
en a Rats g situation, if it is possible, arises 
ad its ol ern is conditioned to one response 
ous one to another. Now suppose that 
Presented Situation very like the pattern is 
Pattern, one It is recognized as the same 
Ms be a new sponse is made, but if it is taken 
hoar then pattern (although perhaps very 
Po eines response is by components and 
lin the ponsa ils made. It is difficult to 
© between precise nature of this dividing 
another presentation of an old 


Atter 
in and the pr p e: 
Sw beter: the presentation of a very similar 


isag 


u, 
ory Models 


ala pat 
Kay atten d-associate item is tested immedi- 
Ow. S presentation it is almost always 
Ny | Spang & Peterson, 1961). This 
yp tate a led to theories of paired- 
the Ect a P e which suppose that the 
tor test, OF s learns, but then forgets before 
moe tting let if the subject just keeps 
i ~~ etween tests he never improves 
’Ster t the first test and certainly cannot 
le One , © Problem. , 
tae ae ae is to suppose that the item 
ite, Pe reise A storage” from which it 
fing Can eit In temporary storage the 
by Tate or forgotten unless it 1S 
Aydin gaan to “permanent storage.” 
e item a Gelfand (1964) propose that 
cessful] permanently stored if it is recalled 
Y at a retraining trial. This theory 
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would lead to very good memory after two 
immediately successive presentations, for then 
bag . el heed likely to remember 

, and thereby fix the 
memory. Peterson, Saltzman, Hillner, and 
Land (1962) and Peterson, Hillner, and Saltz- 
man (1962) have found instead that a rela 
tively wide spacing of training trials is better 
and Greeno (1964) has shown that there 5 
almost no new learning on the second im- 
mediate presentation of an item. There seems 
to be a special virtue in remembering after a 
long pause—and the immediate memory model 
of Friedman and Gelfand does not handle 
this fact. 

Theories with several levels of memory can 
lead to all-or-none learning or to two-stage 
models (Waugh & Smith, 1962). The major 
theoretical problem is to decide what levels of 
storage are possible and under what conditions 
an item transfers from one level to another. 

The cognitive strategy-selection theory men- 
tioned above is a memory model. When the 
subject selects a satisfactory strategy, the 
item is put in what memory theories call 
“permanent storage.” Typically, memory 
strategies in paired-associates learning involve 
selection of a cue from the stimulus. If the cue 
selected is not unique to that stimulus, the 
subject will make generalization errors, then 
resample a new strategy. In memory terms, 
this says that the item was in temporary stor- 
age and was erased by a new item. If the sub- 
ject’s strategy is unsatisfactory for other 
reasons, memory theory might say that the 
subject has difficulty gaining access to the 
the item, or that the memory dis- 
he strategy-selection theory has 
nt levels of storage, and these 
well as quantita- 
be unsatisfactory 


engram of 
sipates. T 
many differe 
levels differ qualitatively as 
tively, since a strategy may 
in many different ways. 


GRADUAL LEARNING MODELS WHICH PRODUCE 
AtL-or-NONE DATA 


ng models postulate a trans- 


Several learnir 
robabilities or under- 


formation on response p 
lying states or strengths. Any of several trans- 
formations is possible, depending upon the 
response of the subject and the outcomes of 
reinforcements. Within each of these general 
models, it is possible to set up a special “re- 


318 FRANK RESTLE 


: e 
BR « ERROR MADE Since this is not a geometric sequen fe 
model predicts some “nonrandomness ower’ 

presolution period,” if analyzed using Pa A 

/ methods. More specifically, runs of i 
will tend to be somewhat longer than pr of an 

by the Bower model. The probability © an 
infinite string of correct responses See 


ERROR MADE 


PROBABILITY CORRECT 


+ ag) is 
= d50 all-or-none model is taken for learning) } 
P= .20 
k i(i- p] 
| en fe = TL Tt- a— 
i=0 
. g «te positive | 
ae Hn Sy an infinite product. This function 18 r such, 
1 28S BS ee oN for reasonable values of the paramett "(Get | 
TRIALS as, for example, po = 50, 9 = ho give ay) 

Fic. 6. Illustration of theoretical underlying proba- Bush & Mosteller, 1955, p- 160, waa ta 
bilities of correct response for a subject who makes  g; 4 
errors only on Trials 4 and 6. (Assumptions are those eee beg values an deduce, for m 
of a linear model in which the probability resets to Po From this model one can ill have w 
at any error.) ample, that the total errors (T) m 

distribution 
current events model” satisfying our definition k 
or all-or-none learning. PT =k) = 0- h 2)'fo 
Linear All-or-None Model where fa is as given above. 

The linear model (Bush & Mosteller, 1955) AlLor-None Beta Model A 
writes linear operators which operate on p, t respos 
the probability of a certain response. In fixed. Luce (1959) has proposed that pe F 
point form such operators are written: strengths, nonnegative variables n genero 

signed to the ive responses. “Sf v; 
Op = (1— Dp + ON g alternative resp of % 


after a trial of type k, the new vala 
where @ is learning rate and } is as 


response 7 is given by 

ra ymptote. 8 y 

Let p be the probability of a correct response = Built) 

and the two operators be Qr, applied when TLA i 


: 1 resp? è 
e a aie akes an error, and Qo, applied for some fiy > 0, depending Or i sup?" 
1l €s a correct response. To obtain nd outcome. For a simple exaMP™ js H% 
an all-or-none model, that vg, the error-producing tendenc)” de, p 
Qo(P) = 0 — op +0 maT O e 
Qr(p) = po. í 
7 ee response increases p toward unity, ohh tid 
ut an error returns $ to its initi 3 
(See Figure 6) p4 itial value pọ After an error, 
The general linear model dis i I 
: plays independ- velt + 1) = 2 ue 
ence of path; given a level of ?, nothing de- x i epe 


t 1 
r 3 . ren” pe, 
pends upon how it got there. Since errors set 3 i gui The emopisnow A "eraus teat 
p at po, errors are recurrent events in this seh Tns (be tgo eae ae 
model. The f distribution is 


original values of 1 and vo. The f dist 


fi= A- = — po) Bis miig 
x To =- 0—0 — po). suf (1 + Bio) d 


: priv? 
and the probability of an infinit? 
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Successes is 
a= 7 Bt 
i (1 + 28") ` 


i=0 


Similar 
Pet results can be obtained with the 
(1956) wie model of Audley and Jonkheere 
iie m Aa TE author’s discrimination learn- 
continuous: estle, 1955), and almost all other 
Pedientsof a by the simple ex- 
aud wron a) defining the responses as correct 
initial y E and (b) resetting the model at its 
alues whenever an error occurs. 


Aor. 
ee Hullian Model with Gradual 


Ab 
ee ie Hallan notion of all-or-none 
kes onl s Mi in which response strength 
obtain allo wo values. It is also possible to 
earning, a data despite having gradual 
(a Sicilian simple model works as follows: 
ave Eunos has zero variance; (b) items 
Me fixed rm distribution from threshold to 
Uuring traj value m below threshold; and (¢) 
tituous ene each item climbs by a con- 
Value, y @XPonential. function from its initial 
in Figure ri value m + x. The model is shown 
ine-half) at A fixed proportion (in Figure 7, 
N SicEoss Mee previously below-threshold 
pve Ro kn reshold each trial. Presuming we 
[Ponse < owledge of the preexperimental 
Ok exact} tength of an item, the data will 
y like all-or-none learning. This is a 
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ty o 
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K ASYMPTOTE OF LEARNING 


+ 


DENSITY f(x) 
p p A 
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ENSITY f(x) 


RELATIVE EXCITATORY STRENGTH (X) 


Fic. 8. A second Hullian model yielding data which 
appear to reflect all-or-none learning although the 
underlying processes are of gradual learning. (In this 
model response strengths all converge to a single 
asymptote with extended training.) 


refinement of the interpretation of Under- 
wood and Keppel (1962). 

Another Hullian model has a single asymp- 
tote for all items, and a common learning rate. 
All items have zero oscillation. Assign value of 
0 to threshold and value of 1 to the asymptote 
of response strength, and suppose that each 
item moves a fraction @ of the way toward 
asymptote from its initial value. The whole 
distribution is compressed by a factor of 1 — 0 
each trial. To have all-or-none learning, & 
constant fraction must Cross threshold each 
trial, and this happens if the cumulative dis- 
tribution function is 


® (x) = etha 


where 8 =— loge(1 — 6) and x ranges from 
— œ to 0. Figure 8 gives a general idea of the 
shape of the original density function given by 


ola) = pri”. 

ullian model says that all 
items will be equally strong after extensive 
training, whereas the first model preserves the 
original distribution. Both produce perfectly 
all-or-none data at the phenotypical level, 
though both models are genotypically gradual 


learning models. 
Tn summary, all-or-none learning can arise 


This second H 
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from versions of cognitive theory, Hullian 
theory, stimulus-sampling theory, and as a 
special case of linear learning models, Luce’s 
beta model, and the usual Hullian gradual- 
learning model. 


EXPERIMENTAL FINDINGS 


The three main methods of experimenting 
are (a) the Rock replacement experiment and 
variations, (b) the Estes miniature experiment, 
and (c) ordinary learning with detailed ana- 
lyses. Of these, the Rock technique, which 
involves a clever experimental innovation, has 
received the closest inspection. The results 
as follows: Using the letters e 
and doubly (AA, BB, etc.) as s 
numerals 1-50 as responses, with slow training 
(up to about 8 seconds per presentation, about 
30 seconds between trials), everyone gets 
Rock’s all-or-none result, Mastery of a list of 
12 items takes four to five 
letter-number p 
(Underwood, Ri 
the results may 
the more difficult items will 


are 
xcept I singly 
timuli, and the 


of such selection are not yet worked out in 


1962), 


Rock (1957) also used nonsense syllables as 
stimuli and responses, in his second experi- 
ment. Even with very slow training these 
much more difficult: Rock’s subjects, learning 
a list of eight Pairs, took an average of over 
eight trials, and made about 3,2 errors per 
Item. Rock and Heimer (1959), using non- 
sense syllables, found no difference between 
repeated (wrong) and 


new items, an indica- 
tion of all-or-none learning, However, other 
experiments using about the Same technique 


have not all obtained all-or-none results : Lock- 
head (1961) with slow training obtained 
none results, as did Bolles in a serial-le 
task in which he scramble 
in the middle of the list after the list was 
partly learned. However, Battig (1962, Ex- 
periment 1) found a difference of 4.8 (experi- 
mental) versus 3.7 (control) errors per item, 
favoring repeated items, Brackett and Battig 


are 


all-or- 
arning 
d or replaced items 
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= 5 of 5l 
(1963, Experiment 1) found differences 


: son (1961) 
versus 2.7 errors per item. Kristofferson (Y or 
found differences of 11.4 versus 8.5 experi: 
a list of 15 CVCs, then repeated the 
ment with only 12 pairs and got TF rapi 
of 12.4 and 8.2. Lockhead (1961), w! found 
training (0.75-second duration of items), 


ate 
2 in favor of repe 
a very large difference in favor of rep 


als ° 
: ean tria 
items. Postman (1962) found mean com 


as 
11.5 to 9.0, then a median of over a items: 
pared with about 9 trials, in lists of re vers 
Rock and Steinfeld (1963) found ight items: 
6.0 trials to criterion, with a list of En 
and mean errors per item of 4.4-3.0, 
subjects were required to spell ae 
Clark, Lansford, and Dallenbach Iso S! 
peated Rock’s (1957) results and a seat io! 
that with rapid (1 second) pres 
learning was not all-or-none. 

These results indicate: (a) Noni 
lables are more difficult to eaa a ie 
letter-number pairs (at least twice al 
trials to criterion, usually more): 
or-none finding, though repeatable. 


"hay in > 
letter-number pairs (and possibly A repe! 


whe 
sense 


by 
tsp 


LS 

ene! 
value iy 
remt” 


gratio 
5 nse intep, par 
the item may involve respo sve 


x. h K re" 
Furthermore, a list of eight CVC el, iż 
g i 


ects caused by stimulus yn ase" 
between formally similar nonsens? es o ý 
used as stimuli, even with one-digit t ust 
Hence, learning of CVC-CVC p% and 
requires some response integration ©". ji 


P . k San agents ant 
Selective stimulus discrimination, ¢ 


ry J, e! 
to produce multiple-stage learning et e 
perimental d; 


ata are on the borderlin® ing 
all-or-none and gradual learning, jt 
Rock test, n 
Some other materials have cage mis? ioe 
varying results, Battig (1962), in tems, p! 
esign with a total list of 12 1 enable 
nonsense forms said to be discrimi 


ALL-OR-NONE LEARNING 


with low association values. Learning was 
rapid (2.7-3.4 errors per item, about six trials), 
E results were not all-or-none. Kintsch 
( 963) used two-digit numbers as stimuli and 
hes as responses, and found two-stage 
sire He used the valuable innovation of 
method $ the control group’s data by Bower’s 
ing rhe ; inding a two-stage model, then show- 
vied at the same model agreed with Rock’s 
Be imental results. Rock and Heimer (1959) 
ie first names as stimuli and re- 
of oon two Studies, along with innovations 
ntere “ie including use of a matching test. 
Meeting eT though the responses are in- 
se 5 and the stimuli presumably dissimilar, 
three ound all-or-none learning only in one of 
Cate: Fi sna It may be that the obvious 
su > ries (especially boy versus girl) lead 
in “Jects to classify these materials and learn 
* multistage manner. 
fe and miniature experiment, the basic 
han ni a training trial (R) followed by more 
none ag test (T-T). The symptom of all-or- 
test eung is a high proportion correct on 
import (T2) given correct on Tı and (more 
iven ant) the probability of correct on T: 
PC Not correct on Tı; the probability 
Ni) should be zero or chance. 
Nae” are essentially the results of Estes’ 
“roth (Estes, 1961; Estes, Hopkins, & 
Ceo S 1960) in two R-T-T studies using 
te pa tli and common English words as 
mm, Ses, and in two free-recall studies using 
the fone English words. The probabilities of 
ur outcomes are shown in Table 1. 
in des Probability is supposed to fall mainly 
(the mae cells: CC (the subject learned), NN 
Subject ect did not learn), and CN (the 
*SSociat learned but forgot). In the paired- 
Coup es learning, some guessing also may 


wine (1962) repeated the above studies 
ky Ae ee as stimulus and a numeral, 
€sponse. The results were: 
P(CC) = .398 
P(CN) = .133 
P(NC) = .089 
P(NN) = .380 


Whiap = 
ch Senerally agrees with Estes’ result but 
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TABLE 1 


PROPORTIONS OF OUTCOMES OBSERVED IN Four R-T-T 
Srvpres REPORTED By Estes (1960, 1961) 


Halted epocaten: Free recall of words 


‘CC-word 
Exp. I Exp. II Exp. III Exp. IV 
P(CC) .238 35 292 151 
P(CN) 147 14 -098 129 
P(NC) 017 -05 -003 014 
P(NN) -598 46 -606 -705 


shows faster learning and a high frequency of 
NC events. Jones delayed the second test. 
Jones also did an R-T-T-T-T experiment, 
saying that if a subject is wrong on T, he 
must not have learned, hence must do badly 
throughout the tests (if the all-or-none prin- 
ciple is to hold). Let C; mean a correct response 
on Test Trial i, and N; mean an error on 
Trial i. The all-or-none assumption implies 


(with 7 > 1) that 
P(C;|Ni) = k 


where & is some constant guessing probability. 
A severe test of this hypothesis is to consider 
items missed on the first test but responded 
to correctly on some other tests; by theory, if 
the subject guesses wrong at the first test and 
correctly on Tests 2 and 3, he still has only a 
chance probability of getting a correct answer 
at Test 4. Jones found that this theoretical 
expectation failed : 

P(C2|Ni) = .29 

P(Cs|Ni, C2) = .61 

P(Cs|Ni, Ce, C3) = .77. 


There are two possible explanations of this 
finding. One is that, contrary to the simple 
all-or-none theory, the items do differ in proba- 
bility of correct response. Hence, most items 
with the response profile NiC:Cs have a rela- 
tively high response probability, whereas those 
with a known profile of only Ni have a lower 
probability. A second hypothesis is that learn- 
ing occurs during test. It might be that when- 
ever the subject responds correctly during test 
he reinforces the correct response, and there- 
fore possibly might shift from the guessing to 
the learned state during the Cs and C; trials. 
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This second hypothesis is compatible with the 
idea that learning is all or none. If the result 
was due to different levels of probability, the 
patterns N, NC, and NCC merely single out 
different subsets of subjects and therefore 
produce the different conditional probabilities 
observed. The calculation can be run back- 
wards with precisely the same effect ; that is, 
one can single out items in which the last 
(rather than the first) response is wrong, and 
then make the parallel conditional proba- 
bilities. The result of this calculation, carried 
out from Jones’ data, is as follows: 


P(C,|Ns) = .25 
P(C2|Ng, Cs) = 45 
P(Cy|Na, Cs, C2) = .67. 


This shift in probabilities cannot, of course, be 
attributed to learning since it progresses in 
the wrong direction. Postman (1963) and 
Schwartz (1963) have done more complex ex- 
periments involving both several reinforced 
trials and several tests and also have obtained 
results not in accord with the simple all- 
none hypothesis. 

The Rock experiment tests whether one 
failure equals zero training, and the R-T-T 
experiment deals with a single training trial. 
Following techniques of Bush and Mosteller 
(1959), Bower (1961) worked out many con- 
Sequences of an all-or-none model and then 
compared theory and experimental results in 
great detail. One study used 10 consonant 
pairs as stimuli, the numerals 1 and 2 as 
Tesponses, and self-paced training by flash 
card. Verbal discrimination (Bower, 1962) has 


yielded similar results, Numerous studies, 
many unpublished, have found that with dis- 
crete stimuli and two sim 


ple responses, all-or- 
none data can readily be obtained 3 


or- 


3 The writer has devised a classroom demonstration 
of all-or-none learning. Stimuli, read orally to a class, 
are two-syllable paralogs such as JEDIN, NABOG, etc. 
Responses are check or plus, Very good fits to the Bower 
model are obtained. However, recently one subject 
remarked in class that his method had been to learn 
to recognize all the words going with plus. All others 
would be called check. Thus this two-choice paired- 
associates problem was learned as a simple recognition 
task. Approximately 200 of 230 Students in the class 
indicated that they had u 


: used similar Strategies. This 
means that two-choice paired-ass 


ce p ociates learning may 
not involve any association learning at all. 
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basso 
Using Restle’s (1962) model, Tra 


ing 
(1961, 1963) demonstrated ier iis 
in concept formation, and Bower its using 
basso (1964) have extended the nduding 
a variety of experimental methods 
several related to Rock’s technique. 


EARNING 
COMPOUNDS OF ALL-oR-NoNnE LEARN 


One alternative to the all-or-none nypa 
is that learning proceeds in Smeri 
stages are all-or-none. Atkinson a in 
(1963) reported eight experimen del g 
which a two-stage all-or-none a the sta 
good fit to the data. In the model, jate 
are So (prelearning), Sı (an IDEN ar 
of performance), and S2 (solution). 
tions are shown in Figure 9. -ain mode! t° 

Kintsch (1963) employed a simila? S petitu- 
predict the outcome of a Roce ett (1982) 
tion experiment. Waugh and Sn vera pro 
analyzed free verbal recall with Sring i 
esses—labeling, processing, and $ 
model like that in Figure 9. ical ate 

Restle (1964), in a long theoreti pa ic 
discussed several experiments, sblishe 
one by Bower and Levine (unpu! the BOW 
another by Restle and Trabasso. In was 
Levine experiment, each stimulus on 
Korean characters. Nine stimuli, each, 
in three subsets of three stimuli ree stim, 
given in a list. In one subset, the other su 
had no elements in common. In a? char ali 
Set, the three stimuli had one age stim 
in common. In a third subset, the th 15 00 


ave 4 
tes 


PA bu 
stimuli with no common elements, th ov 
two-stage learning for items W! 


Jy 
vyer? a 
Fis. 9. Two-stage model employed by mone K 

to account for data in which simple all-0 
4s not exhibited, 


ALL-OR-NONE LEARNING 


The second stage was identified as discrimina- 
=e learning, and it was ascertained that the 
2 ess errors were stimulus-generalization 
trors, The model is that shown in Figure 9. 
ide and Trabasso’s experiment used 
oen syllables, some paired as formally 
eae (vem, vum) and others unique. Re- 
Selo were numbers 1-8 or 1—4. Detailed 
em lg showed strong stimulus generalization, 
P neoe stimulus generalization be- 
via) syllables with the same vowel (rw, 
hii apparent response generalization, and, 
The & guessing, strong response preferences. 
Eaa ipn was that one-, twos, three-, or 
and d our-stage learning could be picked apart 
emonstrated. (See also McGuire, 1961.) 
gean, Restle, and Polson (1965) used 
respons: as stimuli and common nouns as 
Wer ee Some pairs of pictures in the list of 
o simi, twinned: 2 oak leaves, 2 quadrilaterals 
in the T shape, etc. Other stimuli were unique 
earned ist. Items with unique stimuli were 
items y an almost all-or-none fashion, whereas 
ages, with twinned stimuli were learned in two 
stimula a the second stage was identified as 
that te ‘crimination. Since it was assumed 
Selectig mulus discrimination depended on 
Pendent of a differential cue, an event inde- 
the mod of forming the original association, 
Dene derived was that of Figure 9. 
Tespons ers (1964) taught subjects compound 
two-di es in paired associates; responses were 
kianga oe like “red square” or “green 
Tesponse He showed that mastery of such 
Parts, Til was a compound of mastery of the 
m whi SN Iner (1965) taught paired associates 
Arelated the responses were strings of three 
U the q common words, (BANK-YARN-MILK). 
intro, Second and third words were each 
the — only after the previous segment of 
‘bon s was mastered, the data corre- 
of pi ed closely with an extention of the model 
Spo, == 9, but if the whole compound re- 
Veale; êS introduced at once, the data re- 
Made a strong correlation between errors 
tepon, > the various components of the 
Onse, 
model of these studies, the multiple-stage 
data Va8 evaluated by its ability to fit the 
lg’? OUt the assumptions of the model were 
Meth og uated by traditional experimental 
S of manipulating experimental vari- 
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ables and classifying the types of errors ob- 
served. Conclusions were properly based only 
on the quantitative agreement of converging, 
independent operations. = 


Discussion 


Rock emphasized the importance of mne- 
monic devices among which the subject would 
sample in solving the problem of remembering. 
Estes employed the Gestalt idea that the 
stimulus would act as a united pattern. Restle 
reintroduced the idea of selection of hy- 
potheses. These theories are usually set in 
opposition to pure associationist theory in 
which learning is a more passive process re- 
sulting from conjunction of stimulus, response, 
and perhaps reinforcement, in which the typ- 
ical stimulus is analyzed into small elements 
and in which learning is the formation of simple 
habits rather than the selection among more 
complex strategies. 

All continuous-learning (associationist) the- 
ories can lead to all-or-none data, and the 
various cognitive theories of all-or-none learn- 
ing differ among themselves in important 
respects. Rock’s mnemonic devices may lead 
to associative connection rather than selection 
of hypotheses and do not require that the 
stimulus be a pattern. Estes’ pattern theory 
can be a standard passive-association theory, 
requires no complicated strategies, and is more 
association than selection. Restle’s strategy- 
selection theory makes no essential use of 
mnemonic devices, and does not assume that 
the stimulus is organized. 

Since several different association theories, 
and several different cognitive theories, all 
have the capacity to explain _all-or-none 
learning, the simple fact of all-or-none learning 
is not appropriate for the decisive confronta- 
tion of major theoretical approaches. No 
classical theoretical issue can be decided by 
answering whether learning 1S all or none. 

Logically, there are several versions of the 
all-or-none hypothesis, including: (a) All 
learning is all-or-none ; (b) There exists at least 
one task that is learned all-or-none ; () A 
single item of a paired-associate list is learned 
all-or-none; (d) A single item of a paired- 
associate list, provided the list involves no 
stimulus confusion, response integration, etc., 
may be learned all-or-none; (e) Concept 
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formation, discrimination learning, and certain 
other tasks are learned all-or-none; (f) Any 
learning task that involves only one difficulty 
is learned all-or-none. 

Of these, a is certainly false, and c and e 
are no doubt insufficiently qualified. Version b, 
the weakest, is true at least to an excellent 
approximation. Version d is the one most 
often intended in the experimental literature, 
and the experimental debate centers around 
whether Rock’s experiment, or Estes’, actu- 
ally is learned all-or-none, and whether these 
and other experimenters have succeeded in re- 
moving all but one source of difficulty, usu- 
ally the “association phase.” Recent work by 
Trabasso (1961, 1963) and Bower and Tra- 
basso (1964) supports Version e, with certain 
restrictions that are not yet well specified. 

The strongest possible interpretation of all- 
or-none learning is Version f. This is not 
merely an assertion that some example of 
all-or-none learning exists, but is a general 
hypothesis that attempts to specify the neces- 
sary and sufficient conditions for all-or-none 
learning. Unfortunately for the experimenter, 
it is not known a priori exactly what must be 
done to remove all but one source of difficulty. 
Tf, in some experiment, all-or-none learning is 
not obtained, it would be Possible to respond 
that the problem must still involve more than 


one source of difficulty. This shifts the burden 
of proof to the proponent 


ing, who must give inde 
the existence of several sources of difficulty. 

The advantage of Version f is that it 
permits an account, within the all-or-none 
framework, of data that do not show the 
Statistical all-or-none Properties. If a given 
problem presents the subject with several 
difficulties, then its solution can be described 
by a multiple-stage all-or-none model. Such 
models encompass the complexities usually 
studied in the human-learning laboratory, yet 
have a manageable mathematical form, ‘The 
task ahead is to extend the precise logic of 
all-or-none models to the analysis of more 
complex and interesting tasks, 


of all-or-none learn- 
pendent evidence for 


‘The phrase all-or-none denotes the basic assumption 
of a Markov chain; therefore, these Proposals are not 
at all new, but now have relatively solid empirical 
support (see Miller, 1952), 
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applied. 


Spurious correlations and confounding vari- 
ables present a characteristic and recurrent 
problem to the social scientist. When samples 
of children drawn from several social classes 
(Variable X1) are found to differ on the level 
of their professional aspirations (X 2), one asks: 
Are the differences due, at least in part, to 
social class differences in intelligence (X3)? 
If a positive correlation is discovered between 
occupational aspirations (X. 1) and college 
grades (X), the possibility that differences in 
attitudes toward achievem 


ent (X3) mediate 
the relationship is immediately suggested. 


The techniques most commonly used to 
remove confounding effects of X3; on the re- 
lationship between X, and Xz are various 
matched-group designs, the analysis of co- 
variance, and partial correlation. Having used 
these techniques, the investigator is likely to 
feel safe: the confounding variable has been 
removed and need not be considered further 
in the interpretation of results, 

The purpose of the 


Present note is to draw 
attention to the effect: 


s of unreliability in the 


ases will be men- 
f known reliability 

1! My thanks are due to Est 
partment of Statistics, for he 
problem in its proper perspecti 
Kubovy read the manuscri 
helpful comments. 


© the prevention of a spurious association 


Xs. In all these techniques 
systematic bias of under- 
case of known reliability 
er than observed scores, 


è 

tions 2 
of the controlled measure. No a test- 
proposed to the difficulties of sign lity issue: 
ing which arise because of the relia ; 

is 0 

alysis 
Matched-Group Designs and the Andy 
Covariance 


ign Í$ 

The essence of the matched-grouP ve pot 
that cases are selected from two or hae A 
lations so that known differences am me vale 
Populations in the distribution of me grouP® 
able are ostensibly removed. Thus, pee u pa 
of children drawn respectively aa a uates 
and lower-class background may res, SUC 
on the basis of intelligence test po ed i ap 
a matching procedure may be folen c 
attempt to study the effect of so 
some dependent variable without (1942) jig 
tion by intelligence. Thorndike ( ten 
pointed out that such an approach riabh 
lead to Spurious results when th pan pi 
Which is the basis for matching i core s 
with less than perfect reliability- ee in w 
the fallacy is that groups matched |" coor? 
manner are similar in their observo restin, 
but differ in their true scores. ON ” 
the lower-class children will so 
the mean of the population from W® 
have been drawn and will tend wl re 
poorly. The middle-class children WÍ eit 
toward their Population mean, an 
Scores will indicate improvement. effect aay 

The magnitude of the regression © pil 
be calculated by Formula 1, if the ” 
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of . . 
the test within each population is known. 
T = p+ p(X—n), [1] 


where 7 5 
of ies X are, respectively, the means 
the reliabili observed scores in the group; p is 
observed ility coefficient, and y is the mean 
The b foe in the population. 
of X, i gel aie differences in true scores 
served or remain after matching on ob- 
association be make it likely that a spurious 
ë a X, and X: will be found. 
Such effect ing procedure undercorrects for 
groups be s. Thorndike (1942) proposed that 
Smee matched on the basis of predicted 
Y McNe » recently termed “regressed scores” 
Bupa sae (1962, p. 161). Formula: Limay 
he F this purpose. 
Pears ig ion suggested by Thorndike ap- 
Psycholog; ave been largely ignored in the 
atchin gical literature, and the procedure of 
Used, Th on observed scores is still commonly 
regression, fallacies due to unreliability and 
Complex effects are particularly severe ina 
Sign, the bes of the matched-groups de- 
th Vario actorial design where the levels on 
8toups = factors are formed by selection of 
fallacies n the basis of test scores. Regression 
ility wa. ccur when measures of low relia- 
stensib] ich are correlated in nature are made 
ulation orthogonal in the experimental 
Personality This design is quite popular in 
to a research and has often been ap- 
A hyp Casures of doubtful reliability. 
ee is tig example of this type of de- 
spat an eon in Table 1. It is assumed 
esi on n ee sets up a 2 X 2 factorial 
of Ures Pye basis of test scores on two 
ih, these , A and B. The test-retest reliabilities 
we. Measures are .49 and .64, respec- 
thlected 5 Correlation between them in an 
ye Mean Population is .30. The table shows 
slabe sher values to be expected on each 
s5 for mir the mean observed standard 
aR LS high and for the low groups are 
e 
z a uda presented in Table 1 have been 
hich by multiple-regression equations, in 


is o -£ observed values of A and B are 
§ les, „Predict the true scores on these vari- 
Xb scores are in standard units. The 


Sey. 

$i TI 

“on eo 0 and T refer to observed and true 
Spectively. The correlations used in 


TABLE 1 


MEAN TRUE SCORES OF GROUPS IN A FACTORIAL 
DESIGN FOUNDED ON OBSERVED SCORES 


Low, A, = —.50| High, A, = .50 


High, B. = .50 Ar = —.19 Ar = +.43 
Br = +.30 Br = 4.45 

Low, B = —.50 Ar = —43 Ar = +.19 
Br = —.45 Br = —.30 


the computation are: fAoBo = -30, aoar = -70 
BBT 7 80, ABr = 43, Tapp, = 38. The 
correlations between true and observed scores 
are obtained by means of the appropriate cor- 
rections for attenuation. It is apparent in 
Table 1 that the true scores on the variables 
are neither matched nor orthogonal. Spurious 
main effects and interactions are a very likely 
outcome when this design is used with corre- 
lated measures of low reliability. 

It is important to note that this type of 
flaw is actually fatal to the design. There is no 
correcting formula which will simultaneously 
achieve the required matching of values in 
the rows and columns. 

The analysis of covariance has been sug- 
gested as an alternative to the matching pro- 
cedure which presumably overcomes the re- 

y (McNemar, 1962, p. 373). The 


gression fallac; 
covariance design is generally used when there 


exists a substantial correlation between the 
dependent (X: 2) and controlled (X 3) variables. 
Two main cases may be distinguished (Ed- 
wards, 1950) depending on whether system- 
between-group differences exist on the 
controlled variable. In the absence of large 
between-group differences on Xs, unreliability 
in the measurement of this variable simply 
reduces the effectiveness of the analysis with- 
out producing any systematic bias. The case 
where large differences in Xz are found is the 
one where matching and analysis of covari- 
ance are reasonable alternatives. Under such 
conditions, it can be shown that unreliability 
in X; has similar effects on the two procedures. 

Consider the case where the null hypothesis 
is valid. When the reliability of X, decreases, 
the variance of this measure is inflated by 
error. Therefore, the regression of individual 
scores of X2 on the observed values of X3 


within groups is of lesser slope than the corre- 


atic 
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sponding regression on true values of X3. How- 
ever, when real differences exist among the 
means of X; in the different populations 
(classes of X1), unreliability of measurement 
has relatively less effect on the between-group 
variance of X: the variance of observed 
means is partly due to sampling error (which 
is affected by reliability), but it is also affected 
by population differences in X3, on which 
reliability has no effect. Consequently, the 
observed regression among group means will 
tend to be steeper than the within-group re- 
gression, which leads to a bias in the F test. 
It is interesting to note that increases in 
sample size have no systematic effect on the 
expected value of within-group estimates, 
while such increases consistently improve the 
approximation of between-group estimates to 
the true regression. Thus, in the case de- 
scribed, there is an inflated probability of 
Type I errors over stated significance levels 
which is due to insufficient control of X 3. The 
probability of such errors increases as the re- 
liability of X; decreases; it also increases with 
sample size. 

The conclusion must be that the analysis of 
covariance 1s suspect of undercorrection when- 
ever prior analysis suggests that real differ- 
ences among groups exist on t 


he X; variable 
unless this variable is identified wi th ATS, 
reliability. vith very high 


Partial Correlation 


e measurement of X, 


a systematic an i 
ae d predictable 


The partial correlation 
values of X, and X,—with 
controlled—is given in Eq 


between observed 


true scores in X 3 
uation 2:2 


Tio — RaRa 
VASRA RA © 


2 The symbol R is used for coe 
values of X3 are considered. 


Riz. = 


fficients in which true 
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When X3 is measured with reliability P, coeff- 
cients which involve this variable are ae 
ated. The partial correlation, with observ 
scores of X; controlled, now becomes: 


rio — pRisRos 


. [5 
12.3 = Va — PRANA — pR?) 


r 


ie i seen 
When reliability is perfect, it can be sé 


a 
that riv.s is equal to Rio. With zero 
bility, riz is equal to zi, that is, the meatg, 
of direct association which the use of P’ 
correlation was intended to correct. g 

With intermediate values of reliabilly, 1. 
observed partial generally takes & value | 
tween the true partial and the dire ar 
tion between X; and X». Where the true ae 
tial is higher than the direct correlation 5 
underestimates the relationship. Perhaps sta : 
important, when the true partial 15 al sy 
tially lower than ry, the observed aa use 
tematically overestimates the degre? a Xe 
confounded association between Hye res 
In either case, the use of observed sc? gull 
Xs in the partial coefficient yields * “phe 
which is biased in the direction 0! th 
covariance between X; and X2 which 
to X; is not effectively partialled out. e! 

The magnitude of this bias is ilusti ving 
the following example. Consider the 10 
correlations, in which true scores ° 50. 
assumed: 712 = .30, Ris = .60, Res Z < meas 
true partial correlation is 0. When pa stlowing 
ured with a reliability of .64, the m = i 
observed correlations are expected: “jal cor” 
713 = 48, roy = 40. The observed P™ 
relation is .135. es ° j 

In order to obtain unbiased estimat, £ a 
partial correlation in practice, it 15 P ue : 
to correct rı; and ra for attenuatio? cas? A 
the unreliability of X3. Unlike the atten 
simple correlation, the correction for he 
tion does not necessarily yield & m 
mate of the partial correlation. I 
common case, where the relationsh!P ow 


e 


è This statement is not always true- | 
restricted conditions, the observed rA inc af 
vary monotonically when the reliability © A imil gh 

e conditions for this effect are closeY 
the conditions under which Riz. and 712 9 eties, 
In such Cases, partial correlation is of nO P imi 
vance. Its value is then affected to 2 Vê 
tent by the reliability of X3. 
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Xı and X; is suspected of being spurious, cor- REFERENCES 


recti s . y « 

ae for attenuation will yield a lower value Epwarps, A. L. Experimental design in psychological 
r the partial correlation. In general, high research. New York: Rinehart, 1950. 

values of the observed partial will be found McNestar, Q. Psychological statistics (2nd ed.). New 


ag ae than lower values. However, when aoe ue ig PON < 

© obser areal i : snih- HORNDIKE, R. L. Regression fallacies in the matche 

cance ven partial is ‘Of borderline signifi groups experiment. Psychometrika, 1942, 7, 85-102. 
» Considerations of reliability may have 


an important effect on research conclusions. (Received August 16, 1964) 
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ON THE PARTITIONING OF CONTINGENCY TABLES’ 
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The chi-square test is presented in general. If a contingency table is greater than 
2 X 2 the usual chi-square test of independence fails to determine the source of 


interaction if interaction is 
of r X c contingency table: 
tioning an r X c contingenc 
each with 1 degree of freedo 
pendent components is also 
ference about independence 
hypotheses to be tested wi 


Most psychologists are acquainted with the 
use of the chi-square test to test a null hy- 
pothesis of independence of Proportions in a 
fourfold contingency table and as 
goodness-of-fit test. Behavioral sci 
tistics texts generally give the chi- 
little attention beyond these two a 
However, the chi-square test has 
because of its apparent simplicity, for it is a 
test which requires only the “counting” of 
observations and the subsequent Placing of 
observations into mutually exclusive and ex- 
haustive categories. 

Much more sophisticated use 
Square test can be made tha: 


a simple 
ence sta- 
square test 
pplications. 
much appeal 


of the chi- 
n is indicated in 


In the sequel it is assume 
1S acquainted with the use of the chi-square 
1 Preparation of this Paper was sı i 
the Tri-Ethnic Resear p ae T ae g 


c ch Project, Grant Ni . 3M- 
from the National Institute of Mental Beal ae 


Institute of Behavioral Science University of 
y of Col 
0. C. Stewart and R. Jessor, Co-Direttos, This pane? 
comprises publication No. 45 of the Institute. The 
author would like to thank D. E, Bailey and § WwW 
Link for their helpful comments and cogent criticisms. 
? Now at Indiana University, ` 


d that the reader 


present. Methods for extending the chi-square analysis 
s are presented. The procedures outlined consist of parti- 

y table into (r — 1) (c — 1) independent components, 
m. A special partition of a 2 X c table into 3 inde- 
given. The results of partitioning permit precise in- 
or interaction in a contingency table and allow specific 
thout loss of generality. 


put refer 


test. No derivations are presented, fs wil 


h > ; i o 
ence to discussions of important pre 
made. 


CHI-SQUARE TEST 


son O 

on + pution 
If we have a bivariate (joint) distrib sets of 
two discrete random variables requ” 
categories F and Z, then a bivariate 1% ps on 


. * . ation 
distribution of a sample of N observ jes %1? 


(Y, Z) may be constructed. Such ta con 
usually called contingency tables an pave f 
venience we will consider the table a 
rows and c columns, that is, the oe! ane, 
values or categories on Y and Z at roquen F 
respectively. If we record observed 1! 6 ain ? 
in the cells of such a table, then "y eguen) 
table (see Table 1) where ai; is the T su 
of occurrence of the pair (Ya, Z). We T opte 
the frequencies in rows and columns &™ e f A 
the marginal frequencies, that 1$, taken ip 
quencies of the values of Y and Z ¥ 
dividually rather than jointly: 


a; = È tij j= Laoi 


vie 
Now, the contingency table may T dis, 
aS a set of observations—that is, & 
cells of the table correspond to z c 
servations, Suppose we wish to CO peor’ iy 
observed frequencies with some 
frequencies, Npi;, where pis is the P 
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U ofa 3 
r ity eee (Y; Z;), that is, the proba- 
ill in the pote an observation which would 
and N ig + h cell of the contingency table, 
‘ation : e olal sample size as defined by 
quencies = sg: As with the observed fre- 
frand i ee A marginal probabilities 


WEF g 

È bis i=1,2,--,7, and [4] 
bye i 
j È bus FHLAerrye [5] 


Then 


Probability: can be shown, regardless of the 
becomes 17, distribution of (Y; Z;), that as N 
arge (V — œ), the quantity 


i ga 2 (aij — Npa) /N pis [6] 
Is distri ý 

free, oe as x? with rc —1 degrees of 
oo ). Fo his test was introduced by Pearson 
X . The aR see Crámer (1946, ch. 
(viora Era ntities Mp; are known in the be- 
. o Dcies.” T nce literature as “theoretical fre- 
D the obg n this form they are not dependent 
y ervations except in the sense that 


y Py = N. Th: 
N. This one restriction leads to the 


of 1 : 

We ae of freedom; hence df = re — 1. 
Sinals ag ider only the row and column 
i «s Univariate distributions, then by 
{uation 6 we find 
r 
Po Sy 
my Ci — WD;.)?/N i. for rows, [7] 


nq 


tee $ 
im (@;~N p.3)?/Np.; for columns. [8] 


PVE a 
and XZ are distributed as x” 
and (c — 1) degrees of freedom, 


the ™) if ag 

pite is : “4 a null hypothesis we assume that 

T the association (i.e., independence) 

ilig q Occur pier that an observation 
Y that i n a given column and the proba- 

ti, he ae n observation should occur in a 

ttih Sti 


ley, tti 


Neti : ., 
tng tion j tion between the quantity X? and its 


is A 

WA texts, Wee clearly made in most behavioral 
N Se of enoti e will observe the more correct pro- 
Patately S the quantity X? and its x? distribu- 


TABLE 1 
GENERAL CONTINGENCY TABLE 
on tiz iii Mj aaki Ue ay 
an asa, áa 5 Fai LEN a2, 
a h. 
an aro seis arj i are ar, 
ay as sea tj ia Qe N 


given row, then pi;= pip. for each cell. 
Hence, we may rewrite Equation 6 under the 
assumption of independence as 


X= pape (ai; — Npip)/Np:p. [9] 
which is distributed as x? with rc — 1 degree: 
of freedom. Further, under the iypoihess of 
independence and given the observed mar- 
ginals only, the maximum likelihood estimate 
of the expected observed frequency in cell 7 is 


[10] 


ĉj = @;.a.;/N. 
Then we may write 


r (ai; — a4.0.;/NP? 
XZ= BS: Meili he 
peL 2 aN 


i=l jel 
(for interaction) 


roe 


=2 


i=l j=l ĉij 


[it] 


(aij — eu)? 


r c q 
=EL ——-N. 

i=1 j=l Ĉij 
with (r — 1)(c — 1) degrees of freedom. This 
component measures the interaction (or as- 
sociation) between the observations. It can be 
shown (Lancaster, 1949) that the X”s given 
by Equations 7, 8, and 11 sum to yield the 
X? given by Equation 9. That is, 


yea X24 X24 Xe. 


Similarly, the degrees of freedom sum to 
re — 1. That is, 

eo 1e r=) (-—)+¢— 1)(e— 1). 
and X;2, it can be seen that the 
is of contingency tables is an 
vial case of analysis of 


Using XA, Xe; 
chi-square analys 
approximation of a tri 


TABLE 2 


GENERAL CHI-SQUARE TABLE 


x? 
Source af 
5 i, N, «P/N pi. 
Rows r—1 A (ai. b i 
$ 7. 2N. 
Columns c—1 E (a.;—Np.5)*/N ps 


inl 


3 (a;;—a;,0,;/N)? 


Interaction | (r—1) (¢—1) @03/N 


i=l jal 


r e 


E E (aj Npip aV /N pba 


BY aoe 


Total ro—1 


variance. These results are summarized in 
Table 2. 


Thus far it has been assumed that the mar- 
ginals of the observed distribution may differ 
from the theoretical proportions, that is, that 
ai, and a; might not or do not have to be equal 
to Np:. and Np ;, respectively. However, it is 
often the case that the p;. and p; are not 
known. When this is the case one uses a;, and 
a; as estimates of Np;, and Np .;, respectively. 
If such an assumption is made, then X, and 
Xè will be zero. That is, if the row and column 
marginals are not assumed to be estimates of 
the population parameters but rather the 
population parameters themselves, then the 
row and column sums of squares will be 


identically zero and all that remains is the 
row and column interaction sum of squares 
which is X? as usual 


ly calculated for a con- 
tingency table. We now drop the subscript k 
from subsequent formulae since all X25 will 
be interaction terms, 


As already mentioned, the chi 
assesses the discrepancy between a set of 
observed frequencies and a set of theoretical 
frequencies. The larger the discrepancy, the 
larger the value of chi-s 


quare. Intuitively, the 
larger the discrepancy, the less likely it is that 
the theoretical distribution could give rise to 


the obtained distribution. If there are 
degrees of freedom, then the x? distribution 
has the expected value, E[y2] = n, and 
variance Var[x*] = 2n. Hence, given a value 


“Square test 
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seas to this 
of X?, one may attach a probabil a proba- 
obtained value. This probability tions C 
bility that the theoretical pa as large 
give rise by chance to; discrepan ancy: This 
oF larger than the obtained gen P nagnitude 
probability is a function of both t a associat 
of X? and the degrees of freedom 

ith it. i 

E the foregoing iphrodiin agre 
specific applications of the : 
Several cases will be considered. 


0 
n we tu r 


E 
DEGRE: 
Contincency TABLES WITH 1 

OF FREEDOM singen g 

nung; 

Perhaps the most common at behav, 
table is the 2 X 2 table. As races é se 
science statistics texts have IN bå omit a 
tables, detailed rage Se at erre jat 
here. The interested reader 


A 1 he 
McNemar (1962) and Musee consider dh | 
excellent discussions. pola somos OF pd 
variables Y and Z to be dicho 8 


11 may be simplified in this 2 X 
N (anas— 412021) PET] j 

= (anutan) (antan) (anta) in f 
el 


z aD 
i in the followins Me 
or if we label the table in the 


i rewritten o a 
then Equation 12 may be re 


2 N(AD—BC/Y D 
. (A+B)(A+0) (C+D) (BT 


jvatio 

As was noted earlier, the ae ase a 
proof that X? is distributed as X exP 

the assumption that the ei’) , 4 
Tequencies in all cells, tend tO 
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(19 p à Son” 
(1934) has given a “correction for continuity” 


aN Soda be applied when the e;;’s are 

x9 : ‘a correction should be used for all 

ies es. Using this correction, Equation 
ay be rewritten 


xea — N(|AD—BC|—N/2? 
(A+B) (A+C)(C+D)(B+D) ' 


Even w 


[12b] 


not Era this correction, the values of X? do 
isctepan le exact probabilities associated with 
he ar Hence, an even better solution 

y eal expected value problem is met 
dined ae the exact probability of the 
We of Fi ri ag This can be done by 
Kae ear Exact Test which gives the 
Values, 4 a of occurrence of a set of 
tequencies B, C, and D, given fixed marginal 
en in s. Techniques for this procedure are 
; Most behavioral science statistics 
Well ( example, McNemar (1962) and 

(1961). 


Coy 

NTING rx, $ 

INGENCY TABLES WITH MORE THAN 
1 DEGREE OF FREEDOM 


i chi-square test has been 
èd with respect to testing hypotheses 
2x independence of proportions. In the 
Dro a inference about independence of 
“hen cee Was straightforward. However, 
me ns. table has more than two rows or two 
Vide the overall chi-square test can only 
able p Mformation that somewhere in the 
the = observed proportions do not match 
ateher, teal proportions. For most re- 
ses od this is not enough. The question 
ei a just where in a contingency table 
M4 the rn discrepancies occur. If we return 
af eet q alysis of variance analogy, We might 
itive êt it is possible to divide X? into 
i th orp Pendent components analogous 
fe Analysis gonal comparisons or contrasts 1n 
tog th Wit is of variance. This can be done 
Sug 1949; Lancaster, 1949, 1950.) The 
jing i; rovide a very useful method of ana- 
“tee i ae tables with more than 1 
Mde Sac Kimball (1954) has derived 
en i Mputational formulae similar to those 
bgp? w remainder of this paper. 
Cott ae X? into independent com- 
ti Sher necessary to subdivide the original 
Yy table in a special manner giving 


i 
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rise to such independent components. If 
procedures such as those outlined below are 
not used, then independence of the com- 
ponents is not assured and interpretation is 
difficult, if not impossible. In the discussion 
section, further reasons for partitioning will 
be given. While the partitioning procedures are 
fixed and standard, it is to be remembered 
that the ordering of rows and columns ina 
contingency table is arbitrary. Thus it is neces- 
sary for the researcher to order a table so that 
the resulting partitions are meaningful. 

Before giving general partitioning pro- 
cedures, a few examples should be helpful. 
Consider a 2 X 3 table. Such a table can be 
partitioned into two fourfold tables in the 
following manner, the numbers under each 
partition being a number associated with that 


partition. 


au a2 an + t ais 
——— Su 
a21 a azı + a22 | a23 
(1) (2) 


that all frequencies occurring in a cell 
of a partition are to be summed. For instance, 
in Partition 2 above, the frequency in the 
hand cell is a1 + 412, the frequency 


upper left- 
in the lower left is doy + G22, etc. For a 3x*3 


table the partitions are 


Note 


au az an + a tis 
ee a 
a21 az azı + O22 23 
(1) (2) 
au a12 an + t 13 
+ | + + + 
an ax an + a22 azs 
pan an 
az 32 ası + ase a33 
(3) (4) 


The general contingency table (Table 1) 
can be partitioned into (r — We — 1) ae 
fold tables with 1 degree of freedom each. 
The procedure for generating the partitions 1$ 
quite simple. For such a table, a partition Is 
formed by taking the last 7 — 1 row andc — 1 
column frequencies as the lower right member 
of the (7 — 1)(c — 1) independent fourfold 
tables. The remaining three cell frequencies 
for each fourfold table are obtained as follows: 
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The lower left cell contains the row sum to the 
left of the lower right frequency, the upper 
right cell contains the column sum above the 
lower right frequency, the upper left cell 
frequency is the sum of all frequencies above 
and to the left of the lower right frequency. 
That is, the partition associated with cell a;; 
can be represented by the following fourfold 
table: 


anut Gigs + anj tij 
aa + t + + Mj 
Gj-1,1 t taij- tii,j 
ai kaida r aij 


This procedure ensures that independent sub- 
divisions of the original contingency table 
have been made and facilitates computation 
of the component X°s, 


2 X c Tables 
Formulae. For the 2X 3 table above, the 
X?s associated with the two partitions are 
jia N? (a11422— aran)? 
a @1,49.4,14.2(a.1+4 ») » ond [13] 
X= NEaslantan)—an(an+an)] 
A mme r T 2) P 
1.02.4 3(a 1a.) » M] 


the subscript ¢ on the X2 
partition, 


In general, for a 2 X c tabl 
X?s can be computed from 


« referring to the ith 


e, the partitioned 


i j 
N [a —_ : 
Pies Laz, 2 Og = Misr = az 


i=l 


t 1 » U5 
Aata aS aj] : P 


where ¢ = 1,2,..., (e — 1), 


Each of 
can be summed to yield the ee 


Overall X?, that is, 


72 S 
P= D 


t=1 


Example of a 2 X c Table. Suppose we h 
data on some variable 


Let F be high or low. 
following table: 


: ave 
Y for four ethnic groups. 


Then we can make the 
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TABLE 3 
EXAMPLE OF 2 X c TABLE 


Totals 
; Rural Local isl if 
Variables | | nda Indian Spanish | Anglo 
66 
High y | 21 8 24 13 59 
Low Y 14 3 12 30 
125 
Totals} 35 11 36 43 
e using | 


If we calculate X? for the entire tabl with | 
Equation 11, we obtain X? = 14.03, a disti 
3 degrees of freedom, reference to 2X hobr 
bution table indicates a significance Pll 
bility of p < .005. This large X? an4 are i 
significance probability indicate that nd tH 
interaction between ethnic status, rep? 
variable F. That is, there is a large a n 
between the observed frequencies tne 
theoretical frequencies as determine me p 
marginal frequencies. However, it ™ ce 
that the discrepancy is large for O isc" 


f 
and small for others. In order rsoropnl 


where in the table the major dise edule 
oning pr? 5 


5 


ncy 


arise, we resort to the partiti re # 
described for 2X c tables. As the in 

degrees of freedom we may obta! w c0" 
partitions. These three partitions gue 
parisons to be made as follows: oF 
Indian-Local Indian (RT X LT) co™ 


pariso™ 


xe, — 125%(3-21 — 8-14} _ 0.5% 

y= see 

wh "66-59-11 +3546 s) 
LX 

and for the Indian-Spanish [RI 7 

comparison, 


x2, _ 12512-29 — 24-11} 0.30 


66-59.36-46:82 xh) 


LI- 5 
and for the nonAnglo-Anglo [(RI— 
comparison, 


9 9. 

ye, — 125?(30-53 — 13-29%} L 133 
A St ee 

66-59-43-82-125 ent” 

y re 

T hese partitioned X?s may now beh j 

in an “X? Table.” js D0 vost 

This analysis indicates that there ™ je us 


. a w yi 
tion within the nonAnglo groups “pel 


is high (and significant) variatio” 
nonAnglos and Anglos. 
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TABLE 4 TABLE 5 
X? TABLE FOR 2 X ¢ EXAMPLE EXAMPLE OF y X c TABLE 
Sour é 
ce df X: p Attitude score 

4 RIXLI i s4 Group 
a Ri-LDxs 1 a | ae = i N BE 

LI—S)XA 1 13.39 p<.001 Indian 25 17 20 62 

Total 7 Spanish 27 y 5 5 
al 3 14.03 p<.005 Anglo 13 33 2 A 
Totals 65 57 90 212 

TX c Tables 


Fo 
3 oe For the 3 X 3 table given above 
S Associated with each partition are 


NÇas (asa — NRR) 
— a (auan — 4.1422) P 


a102. 14.2 (01, +02.) (a 140.2) 
KS MEn (antai) — a (aHa) F 
41,42,4,3(a1,4+-a@2,) (a 144.2) 


Xe Mlan (antan) — as (a+ ao) F 
%3,4 1a 2(@1, +a.) (a 140.2) 


NLass (antant antan) 
= (antan) (ası+ 432) 


43.0.3(a1,-+ a2) (1+ 0.2) 


Pa 


, [16] 


» 17] 


, [18] 


ya 
[19] 


Th 
8eneral, for an y X c table the partitioned 


a . 
sociated with aij is given by 
7 il jo il jl 
Nasla; E Zan Lan. L aie) 
h=1 k=1 hal kel 
j-1 i-1 i=l " 
Xs = ap (ai. D anj tij bis an) F 
k=l h=l h=1 
a j-1 j i—1 i 
LAOD ax)( aa ( È an) (È an.) 
Wh k=l k=l h=l h=l 
rey — 
rang Z EDG- Dti hie De 
bee 2,-++,¢. It should be noted that 


~ 8eherg 4 
‘nj Tal formulae are not as formidable as 


ht fret 
Dartit F appear. After computing only a few 
Metho ed X%s, one can readily apply the 


ey, tam oe recourse to the formulae. 
inig ble. Returning to the study of our 
thnj, Sfoups, suppose that we have three 
wre ous, and suppose further that we 
W ae on a trichotomous variable 
€ ays cesures an attitude Y. Responses to 
itude measure can be negative (-), 


neutral (0), or positive (+). The data are 
presented in Table 5. 

If we calculate X? for the entire table using 
Equation 11, we obtain X? = 24.75, and with 
4 degrees of freedom, p < .0001. This large 
and significant X? indicates that there is inter- 
action between ethnic status and the measured 
attitude F. To study this interaction in greater 
detail we partition X’. 

The partitioned Xs are as follows: (1) For 
the Indian versus Spanish comparison on 
negative versus neutral attitude, 


212[59(57-25— 65-17) 
—62(57-27—65- 7P 


62-59-65-57-121-122 =247, 


X= 


(2) for the Indian versus Spanish comparison 
on negative and neutral attitude versus posi- 


tive attitude, 
212(25-42 — 20-34)? 


s2-59-90-121-122 7 2h 


X% = 
(3) for the nonAnglo versus Anglo comparison 
on negative versus neutral attitude, 
-52 — 13-24)* 
2122(33-52 — 13-24)? _ 1781, 
01-65+57-121-122 


for the nonAnglo versus Anglo 
tive and neutral attitude 


= 


(4) finally, 
comparison in nega 
versus positive attitude, 
212(45-76 — 45-46)? 
91-90-121-122 


= 3.20. 


X% = 


These statistics may be summarized in a X? 


Table. : 
These data suggest that while there are 


overall ethnic group differences on the attitude 
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TABLE 6 
X? TABLE FOR THE r X ¢ EXAMPLE 


Source af X: b 
2.47 |ns 
1) IXS on (— vs. 0) 1 
m IXS on (—, 0 vs. +) i 1.27 |ns 
(3) (I—S)XA on (— vs.0) | 1 | 17.81 |p<.o001 
(4) (I—S)XA on (—,0vs.+)| 1 3.20 | p<.10 
Total 4 | 24.75 | p<.0001 


Y, there are no differences between the Indian 
and Spanish groups. Consequently, the high 
X? is primarily due to differences in the Anglo 
and nonAnglo cells. Further, most of these 
differences are due to negative and neutral 
attitudes with only slight differences among 
Anglos and nonAnglos due to the comparison 
of negative-neutral attitudes and positive 
attitude. Note that exchange of rows and 
columns may be necessary to arrange the 


table to test other specific comparisons with 
these procedures. 


A Special Partition of 2 X c Tables 


Procedure. There is yet another 
method of partitioning 
of interest to the beh 
method is due to Br 
somewh 


useful 
a 2X c table which is 
avioral scientist. This 
andt and Snedecor and is 
at more familiar. Suppose there are ¢ 
groups (or ¢ values of some variable) for which 
some dichotomous variable is measured (e.g, 
the presence or absence of some trait) for the 
several groups. The data could be represented 
as in Table 7. 


Then X? for this table can be given by 
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Equation 21 is equivalent to Equation oe 

Suppose we believe that for the “cade 
there is some a priori basis for ges the 
that the first cı groups are different a thir 
subsequent cz groups on the Basis i“ where 
variable, say sex, age, education, : arbitrary’ 
the groups may be ordered in any pi for 
manner. It would be desirable ie roups: 
differences between these two sets a the 
One way to do this would be to co 


test 
f s an 
two sets of groups into two group old table: 


that f 
with c— 1 degrees of freedom. Note 


partitioning we can assess the pe 
within and between the two poima sive 
the following manner. The nt into 
Equation 21 can be partitione! 
additive components: 


Ba meih fc B" e 

X? due to variation within the set con gr P 

X? due to the variation between the pes? 
ap th 

d weil 


The degrees of freedom associate respec 
X’s are ¢,— 1, c — 1, and 1, iain the 
These three X?s can be added to obti 

calculated for the entire table. 


a > 1 
The procedure is to alter Table 
tables: 


into w 


TABLE 8 


JAL 
p SPECI 
GENERAL REPRESENTATION OF 


Two Sers or GROUPS 


ss S j 
X = ma x a, a BN 
TABLE 7 ûn üa diey as" 
GENERAL REPRESENTATION OF SPECIAL Absent HAL SR, SA aH Na 
Partition or 2 X c TABLE Totals ia ie Hee di ee 
Group ~ i gota? 
Trait — ~] Totas Trait 2 SON 
E wes @ Ba me G 
Present Gu aie Qie a Present arap ttt Ate 
Absent an dae Qe a2 Absent üreyi tt Ate Ne 
Totals Gi Ge ms ays N 


Totals Greet ott de 


pes p FOR 
PARTITION or 2 X c TABLE 

ES] i 

ota 
Cı groups 

Trait eee EE 2 
1 E au a (n 

= Ma 

Present tes 


PARTITIONING OF CONTINGENCY TABLES 


Note that = Na + Nes, 


ie + i.e = 1, etc. 


Then the computational formulae for the 
Partitions become: 


Aaa [. xy = T] [23] 


7: 72 2 2 2 
anea Ne [ Sia + Qc a [24] 


aas L Na © Na N 


a pe Suppose we have the following 

irae Some trait Y for three ethnic groups. 

sex dime. we wish to know if there are any 

Btoup P RR when we also expect ethnic 

in Tabl ifferences, We may represent the data 
e 9, 


tx TABLE 9 
“AMPLE OF SPECIAL PARTITION OF 2X c TABLE 
Trait Y Groups Totals 
Te AS SP IF AF S? IF 
ier 6 11 13 6 15 16 67 
17 4 11 14 6 6 58 
T . 
tals | 23 15 24 20 21 22 125 
Then 


> Using Formulae 21 through 24, we have 
Xa. 195 , 
65-59 [125-40.824 — 67] = 19.79 


Or 
the Overall X, and 


x, _ 62 
n = 35 gp [62-16.674 — 30°] = 8.60; 
X 63 . 
1> yzg [6324151 — 37] = 9.95; 
Caga 125 F30, 3P 6P ] = 1.31 
67-58 | 62 t 63 135] 


ORES a i 

ig i m indicates the male group and f in- 

Um, ` the female group. These results may be 
“tized in an X? Table. 
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TABLE 10 


X? TABLE FOR EXAMPLE OF SPECIAL 
PARTITION OF 2 X c TABLE 


Source df X: ? 
Within males va 8.60 p<.02 
Within females 2 9.95 p<. 
Males X Females} 1 1.31 ns 

Total 5 19.79 p<.005 


These results allow us to infer that there are 
ethnic group differences on trait Y for both 
males and females, but that there are no 
overall sex differences on trait Y—hence it 
is ethnicity, not sex, which leads to overall 
significance in the above 2 X 6 contingency 
table. 


Small Expected Values in the r X c Table 


As was mentioned for the 2 X 2 table certain 
corrections for the X? value in such a table are 
warranted on the basis of small expected fre- 
quencies. When a table is not 2 X 2 it would 
seem that some correction would be called for 
when small expected frequencies are involved. 
Usually the procedure has been to collapse 
(pool) categories with small expected fre- 
quencies. Cochran (1954) has shown that such 
a procedure seems much too conservative and 
advises against the traditional pooling criteria, 
as does Maxwell (1961). For contingency 
tables with small NV or few rows and few 
columns the researcher can resort to exact 
probability test procedures given by Freeman 
and Halton (1951) and Bennett and Nakamura 
(1963). For further discussion see these four 
references. ue 

With respect to partitioning procedures, 
pooling of small expected frequencies and cor- 
rections for continuity should not be done. 


DISCUSSION 


At this point, there may still be questions 
as to why the researcher should resort to the 
rather complicated partitioning procedures 
when it would be easier to set up several 2 X 2 
tables and calculate the X? for each one using 
Equation 12b. There are two objections to 


doing this. 
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First, recall that in setting up the contin- 
gency tables it was assumed that the marginal 
observed frequencies were the population 
values. This being so, the marginals of any 
arbitrary 2 X 2 table extracted from a larger 
table may not reflect the true marginals of the 
larger table, since any 2 X 2 table on which 
X? is computed from Equation 12a or 12b 
assumes that the population is bifurcated in 
the proportions indicated by the marginal 
values of that table. That is, any selected 2 X 2 
tables would reflect the nature of as many 
populations as tables. Recall that the cate- 
gories must be both mutually exclusive and 
exhaustive of the population on which infer- 
ence is desired. The method of partitioning the 
degrees of freedom allows generalization back 
to the entire population and not just to the 
population contained in the partition. Thus in 
the example given first, inference about the 
ethnic groups could be related back to a single 
population containing all of the ethnic groups. 
Hence we were able to reject an hypothesis 
that the two Indian groups were dissimilar 
in a population containing all of the ethnic 
groups. 

Second, as discussed earlier, the X?s for 
such arbitrary 2 X 2 tables would be corre- 
lated, and appropriate inferences on a set of 
such Xs would prove to be difficult. However. 
partitioned X°s are not correlated and can be 


interpreted independently, thus allowin; 
ingful inference, ee 


determined before calcula 


titioned X°s, For instance, in the first example 
it was hypothesized that t 
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similarity among nonAnglos and that ar: 
Anglo groups should differ from the F s 
group. Consequently, the table was lai Ph 
so that the Rural Indians and Local In Sn 
could be compared and similarity ani 
Indian groups assessed. It was then pos an 
to assess differences between Spanish non 
Indians. Lastly, the differences between ‘hi 
Anglos and Anglos were evaluated. ae 
manner the relevant hypotheses were tes 
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DESIGN OF EXPERIMENTS TO MAXIMIZE POWER 
RELATIVE TO COST: 


JOHN E. OVERALL anp SUDHIR N. DALAL 
University of Texas Medical Branch 


The relationship of power of F tests to expected mean squares, E(MS), in the 
analysis of variance is discussed. While components of variance in the E(MS) are 
largely a function of nature, the coefficients associated with them are matters of 
experimental design. Frequently a different cost is associated with each type of 
experimental unit represented by the different coefficients. It is possible to maximize 
power relative to cost by optimal allocation of available resources among various 
types of experimental units—for example, numbers of Ss, duplicate measures, 
replicates, etc. A simple index of relative power involving the ratio of the estimated 
F ratio to Fa is proposed as useful in choosing the allocation of resources most likely 


to yield significant results. 


tar mani contribution by the statistically 
experi ro owist should be in the design of 
allocat ents. In particular, it is important to 
€ resources in such a way that there is 
Sow imum probability of detecting true 
eration differences when they exist. Con- 
and a of expected mean squares, E(MS), 
Such a paa of components of variance makes 
Practicalities choice of experimental design a 
syeh vA in many situations. 

ES) ta are familiar with the use of 
the ana} or selection of proper error terms in 
of q 2/ysis of variance (Gaito, 1961). Methods 
various 8 E(MS) have been presented in 
Winer i exts (Federer, 1955; Lindquist, 1953; 
EMS) 962), and, in fact, complete tables of 
are pr Values for most experimental designs 
Reinera) sed in these and other sources. Tn 
in the d a proper F ratio is one which contains 
Contain €Nominator all components of variance 
associ Sq AR, the numerator except the one 
ated with the parameter being tested. 
mant experimental situations, the E 
able 4, menter) has various alternatives avail- 
testing him in efforts to maximize power in 
Size, i hypotheses, He can increase sample 
Secure Š more reliable measurement techniques, 
lect) multiple measurement on each S (sub- 
dum Stratify samples into larger or smaller 
More > of levels according to values on one or 
of Sthe nite variables, or do one of a number 
telatiy, T things which will tend to reduce error 
an © to treatment components. Unless he 
tail research was accomplished using a computer 
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has unlimited resources, he must weigh the 
probable value of these alternative methods 
against their costs. If his resources are quite 
limited, he may wish to maximize the proba- 
bility of obtaining a significant F ratio relative 
to the cost of the experiment. 

The purpose of this article is to discuss the 
usefulness of E(MS) and components of 
variance estimates in designing experiments to 
have maximum power relative to cost. The 
probability that an F ratio will result in 
rejection of the null hypothesis (when it is 
really false) depends upon the alpha level, 
degrees of freedom, and upon the components 
of variance and their associated coefficients in 
the (MS). While the components of vari- 
ance representing various error factors may 
be largely a function of nature, the magnitudes 
of coefficients associated with them are ex- 
perimental parameters under the control of Æ. 
These coefficients represent the number of Ss, 
number of levels of a control variable, and so 
on. If estimates of components of variance are 
available, Æ is free to design the experiment 
so as to maximize the probability of a signifi- 
cant F ratio within the limitation of resources 
available to him. , 3 

E(MS) and F Ratios. The F ratio used in 
testing a statistical hypothesis contains in its 
denominator all components of variance in- 
cluded in the numerator except the one related 
to the H being tested.? If the component of 
variance due to treatment effect being tested 

2 This use of the term “component of variance” in 


onnection with fixed treatment effects is slightly 


paradoxical; however, the meaning is clear and the 


variance of the fixed treatment is of = EBè/k. 
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A 
Fic. 1. Heuristic diagram of central and 
noncentral F distributions. 


is actually equal to zero, and the numerator 
and denominator are independently distributed 
as chi-squares, then the sampling distribution 
of the ratio will be a “central F distribution.” 
It is frequently said that the central F dis- 
tribution is the sampling distribution of the 
ratio of two independent estimates of the 
same population variance, and several writers 
have shown that the central F distribution 
provides an adequate approximation, even if 
numerator and denominator do not have chi- 
square distributions, so long as both are inde- 
pendent estimates of the same population 
variance (Satterthwaite, 1941), On the other 
hand, if the numerator of the ratio contains an 
additional nonzero treatment term, the numer- 
ator and denominator will not be estimates of 


the same population variance and the statistic 
will have a “noncentral F distribution.” 


2 2 
oe + moi 
Fea A 


oè ? 


[1] 


where o is the total variance common to both 


the numerator and denominator, and mo? is 
the additional term present in the numerator 
but not the denominator, 


In the general case, the denominator and 
numerator may contain several common terms, 
that is, the error variance may be composed of 
several different components. Let o2 be the 
general representation for all components of 
variance appearing in both the numerator and 
denominator of the proper F ratio. In a similar 
manner, let moë represent the addition treat- 
ment component where m stands for magnitude 
and is a general notation for all coefficients 
associated with that component of variance, 

As one specific example, consider the F ratio 
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i ct 
igni ain effe 
for testing the significance of the p 7 
for A in a two-way mixed-model ana 
variance: 


aê t noat t+ bien’ p2] 


2 
oè + noan” 


F 


3 is 
riance 
In this example, the total error va 
composed of two components: 


2 
of = oè + noas- 


A cients 

The treatment component oq” has ee 
bn so that bn = m of Equation 1. In @ =e 
manner, the variance in any F mn? atment 
conceived in terms of error and te? 
components as in Equation 1. sie diagra® 

In Figure 1 is presented a peurishe mi 5 
showing theoretical sampling distribu marke 
under two conditions. The distribution | „cludes 
F is the central F distribution which Fi jndee! 
the range of values one would ere two 
the statistic computed was the rati 


ation 
r p opula d 

independent estimates of the sae OP sen 8 
variance—if mo2 = 0. A value Fa! if in 


being the largest value likely to ae 
deed the statistic computed is the ra varjant 
independent estimates of the An han # 
If an F ratio having a value larger atio PE 
obtained, it is concluded that the A distrib” 
ably did not belong to the central J’ jg the 
tion. The distribution marked a i" 
theoretical sampling distribution D 4 ser 
in which the numerator contains on in z 
treatment component maë not pres% th 
denominator. The distribution 3 
sampling distribution which the hypo” y 
would actually have if the null PYF trea 
were false and if the magnitude of r 
ment differences were mo. ‘tical value 
In Figure 1, the value Fa is the cri en ord 
of F beyond which only alpha ‘ena wou 
-05 or .01) of the total range of V hypot ie 
be expected to fall if indeed the null BYE, false 


an i 
ment component in the numerator ape f 
ave a noncentral F’ distribution- pse vel g 
distribution, the probability of an oa nde 
ratio exceeding F, is equal to the paw ii 
lying to the right of Fa. The probs’ 
making a Type II error (failing to E 
When false) is equal to the area UPC" go 
the left of Fa. In order to calculate 


MAXIMIZING POWER RELATIVE TO COST 


of a test, we need to know the distribution F’ 
and the critical value Fe. The power of the 
test is then the integral of F’ from Fa to zero. 
The distribution F’ depends on degrees of 
freedom (df) and on the noncentrality pa- 
Tameter A (Fisher, 1928; Patnaik, 1949). 
Various writers have presented power function 
curves (Feldt & Mahmoud, 1958; Pearson & 
Hartley, 1951), but in general these are not 
Satisfactory for the applications considered 
here. Power function values calculated for an 
extended set of parameter values are presented 
m Table 1. The values in the body of this 
table are the power of F tests for a .05, df Nı 
and V2, and noncentrality parameter À. 
he noncentrality parameter à can be 
calculated directly from estimates of com- 
Ponents of variance in the numerator and de- 
nominator of the F ratio for the test under 
Consideration. From the general form of 
quation 1, the noncentrality parameter is 
as follows: 


` = kmoè/oè, [3] 


oe k is the number of treatment means 
ie into the null hypothesis, a? is the 
tot, eon of true treatment means, and mis the 
EM coefficient associated with of in the 
erro 5), and where o,’ is the total appropriate 
ii variance. In the general case, the de- 
Ominator of the F ratio will contain several 
ee Ponents of variance—that is, the total 
ei Variance, «2, will be composed of several 
tice eiis: In a similar manner, the coeffi- 
ass m may be a product of several coefficients 
Ociated with the treatment component m 
the EMS) of the numerator. For example, in 
cen Wo-way analysis of Equation 2, the non- 
trality parameter \ would be as follows: 


A = kbno è/ (cè + noan’). [4] 


co Similar manner, the N value for og ga 
rm, ns tn the numerator k times the trealmen 
of including coefficients, from the numerator 
the expected F ratio, and the denominator 0 
i all components of variance — 
k i S appearing in the E(MS)e The coefficien 
S the number of means entering into the 
YPothesis, In the case of a test of interaction, 
Will be the number of celi means involved. 
taining Components of Variance Estimates. 
never previous research results in an 
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analysis of variance summary table, it is 
possible to compute estimates of all compo- 
nents of variance appearing in the E(MS). The 
first step is to write the complete set of E(MS) 
for the particular experimental design. Sup- 
pose, for example, that the design is a hier- 
archal design in which there are m independent 
normally distributed observations on each S 
and in which there are 7 different Ss in each 
treatment group. The E(MS)s for this design, 
as given in Winer (1962, p. 185) and in other 
sources, are as follows: 


E(MS)) = cè + mo? + nmo? 
E(MS,) = cè + mo? 
E(MS,) = cè. 

For any table of E(AZS)s, the estimates of 
individual components of variance can be 
calculated from the MSs obtained in an analysis 
of variance. The best unbiased estimate of the 
error component is the MS for error: 


MS, = cè. 


Substituting this value into each equation 
containing only this value plus one other com- 
ponent, an estimate of the component of 
variance associated with the next level can be 
computed. In the hierarchal design of this 
example, the component of variance associated 
with variation of observations within Ss can 
be estimated by substituting MS, = cé into 
the equation for E(MS;): 


MS, = cè + mo? 
MS, = MS.+ mo? 
MS, — MS. 
Ca] 
m 
tituting the two estimates into each 
ng these two plus one other 


yields estimates of components of variance at 
the next level. In the example considered here, 
the component of variance due to treatment 
differences can be estimated as follows: 


MS, = cè + mo? + nmoe 


MS, — MS. j 
MS, = MS.+™\ — pm + ume? 


MS, = MS, + nmo? 
MS;— MS, 
nm j 


Subsi } 
equation contain 


of 
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For any experimental design, the estimates 
of components of variance can be solved for in 
a similar manner. Starting with the basic 
error component, substitute the observed error 
mean square for its expected value. Succes- 
sively, substitute previously obtained estimates 
of components of variance into higher level 
E(MS) equations in such a way that, at each 
step, there is one unknown variance compo- 
nent to be solved for. It is entirely conceivable 
that, in planning a new experiment, some 
components of variance to be present in the 
new experiment may be estimated from one 
previous analysis and other components from 
another. 

Estimating Component of Variance for Hy- 
pothesis. One of the most difficult problems in 
use of the procedure described herein is that 
of estimating the magnitude of the component 
of variance due to treatment. Three specific 
suggestions can be made: Results from other 
studies can be used to estimate the likely 
magnitude of true treatment differences for 
the criterion measure involved. For example, 
where new psychiatric drugs are being eval- 
uated using standard measurement techniques, 
one might require that the power be sufficient 
to detect treatment differences comparable in 
magnitude to those previously observed with 
other active drugs. Secondly, one may actually 
attempt to estimate the probable range and 
distribution of treatment means. Where one 
can hypothesize the order of treatment means, 
the probable value of the largest and smallest 
mean can be estimated and other means in- 
serted at intermediate points. The component 
of variance due to treatments can be estimated 
using Equation 5, where B; is the deviation 
of the individual treatment mean about the 
grand mean. Finally, 


an. one can simply decide 
on the minimum deviation of an individual 
treatment mean about the grand mean which 


should be considered important to detect. 
The component of variance due to treatments 
can then be estimated from the minimum 
meaningful deviations using Equation 5, This 
latter procedure has been recommended by 
both Patnaik (1949) and Winer (1962). 


k 
of = È Be/k, [5] 
where B; is the hypothesized deviation of the 
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is the 
individual treatment mean, and & 
number of treatments. eon 
The noncentrality parameter \ can se or 
puted from the component of a. in the 
treatments, of, and the componen 


‘on 3: 
E(MS), using the general form of Equatio 
d= kmo?/oe. 


ry om- 
Alternative Experiments and Cost. F Be ious 
ponents of variance associated wi which 
error factors are empirical realities M other 
the investigator must contend. On ith thes? 
hand, the coefficients associated MS)S are 
components of variance in the BC under 
parameters of the experimental ian 
control of the Æ. The coeficient t deter 
represent the number of independen q repre” 
minations on each S. The coeficient ’ offi- 
sents the number of Ss per group. f 
cient r may represent the number © atters © 
a control variable. These are all m 
experimental design. amine the 
One of the most direct ways tO pee g 
effects of variations in the magn! er hef 
coefficients in the E(MS)s is to cons! the 
ratio. The estimated F ratio, forme the 


the magnitude of the estimated F A y 
to correspond to an increase in pon E 
shall see in a later section, the re E Ms), 
estimated F ratio (derived from the critic 
ne components of variance) pA ý 
value Fe provides a convenien a 
index of faiths power for comparing T 
native experimental designs. The ie 
estimated F ratio relative to Fe ` pto 
will be the area under F” and to the meavol 13 
Consider an experimental desig? z SU w 
m independent measures on each of tals 
in each of p randomly selected hos] 
for k different treatment groups: as 2 145) 
assumptions, this could be viewed po 
archal design having the followin’ 
(Winer, 1962, p. 188): putt? 


EMS) = oè + moè + nmow pi 2 
E(MS)) = oè + moè + 0 
E(MS,) = cè + mo? 
E(MS,) = cê. 


» Me 


MAXIMIZING POWER RELATIVE TO COST 


The proper F ratio for testing the signifi- 
io of treatment differences would contain 

€ following components of variance and 
coefficients: 


2 
oè + moè + nmo + pram? 
Fe : P [6] 


oè + moe + ume)? š 


a nt — F ratio can be expressed in alternate 
dino Dy dividing both the numerator and 
obvi minator by pnm. In this form, it becomes 
th ous that for given values of cë, os and o? 
€ estimated F ratio depends upon ™, 1, 

and p: 
2 


of 


oe on 2 
= mon Tae Pp as 


ce sè on 


pnm pn $ 


n = increase in any one of the coefficients ™, 
A $ has the effect of increasing the numer- 
nomi of the estimated F relative to the de- 
in n inator, but not at the same rate. Increase 
s ie no matter how great, can only affect F 
Si decrease in the first term of both 
chad and denominator. Increase in z 
two to affect F through reduction in the first 
Mile in both numerator and denominator. 
ing if the number of Ss in each hospital were 
teased without limit, the term associated 
Not true variations among hospitals would 
req e affected. An increase in $ tends to 
“ce all three terms common to both the 
vie and denominator. As p increases 
the Out limit there is no limit on increase 1 
i ee mated F value. Other things equa, 
Woe in the largest unit (hospitals instead 
in p oe will effect the greatest Increase 
Tneveace what extent one type of unit should be 
Upo ased as opposed to another will depend 
coy the relative magnitudes of the various 
ets of variance and upon the relative 
S Of adding the different kinds of units. 

ang VEN the estimated F ratio of Equation 6 
oe “Stimates of the components of variance 
owt? oR, and of = B2/k, it is possible 
an calculate A for any combination of 7, 1, 
? and then to look up the approximate 
er of the test in Table 1.° Under restrictions 


p . [6a] 


Pow, 


3 
ingethe values in Table 1 were calculated by numerical 
tio Station in two parts of the noncentral F distribu- 
unction using Simpson’s rule. Some 30 values 
a wide range of m, mn ^ and F’ have been 
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of limited resources, it is possible to compare 
the power of alternative tests of the same hy- 
pothesis and to choose that allocation of re- 
sources which will yield maximum power. The 
noncentrality parameter \ for the hierarchal 
design of Equation 6 is as follows. 

[7] 
where the numerator is & times the last term 
in the numerator of Equation 6 and the de- 
nominator is the denominator of Equation 6. 

If there were no practical limitations on 
available resources, the power of a test could 
be increased to unity by increasing experi- 
mental units represented by p, n, and m. In 
actuality, time and resources will not permit 
this. Each sampling unit (duplicate measure, 
individual-within-hospital, hospital, etc.) usu- 
ally has a cost associated with it. Usually, the 
cost associated with one type of sampling unit 
will be different from the cost of another. It 
is this fact that frequently causes one alloca- 
tion of resources to be superior to all others in 
maximizing power relative to cost. 

Choice Among Permissible Experiments. A 
“permissible” experiment is one which can be 
conducted within limitations of resources 
available to Æ. Since power increases mono- 
tonically with increase in number of sampling 
units, only those possible experiments which 
utilize all allocated resources will be con- 
sidered “permissible.” For example, suppose 
that $2,000 were available for an experiment. 
The set of “permissible” experiments would 
include all relevant experiments costing ap- 
proximately $2,000. 

Suppose that a researcher has $2,000 to 
study the effects of a new tranquilizing drug 
fatty acids (FFA) following 
der stress. It costs $40 to train 
each S up to the performance criterion. Each 
test trial, 1 week apart to eliminate any serial 
correlation, costs $10. Each laboratory — 
mination of FFA level from a blood saci 
costs $2. A hierarchal design 1s considered in 
which m independent FFA determinations mi 
to be made from the blood sample taken fol- 


A = kpnme?/(c2 + mo? + nm), 


on serum-free 
performance un 


k ainst values computed using tables of the 
PET B-function as described by Pearson (1934) 
and have been found to agree to three places. Thus, 
the two-place values reported in this table can be 
accepted as accurate to the second decimal place. 
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TABLE 2 


ANALYSIS OF VARIANCE FROM Previous STUDY 


Source SS of MS a? 
Subjects 1,917,947 | 23 83,389 5,683.5 
Trials 2,730,312| 72 37,921 | 18,809.3 
Duplicates 29,036| 96 302 302.5 


lowing each of p independent trials for each 
of n randomly assigned Ss in k = 2 (drug and 
placebo) groups. This is a three-level hierarchal 
design similar to the one considered in the 
previous section. The estimated F ratio is as 
follows: 


oè + mo? + pmo? + npmoè 


F= 
oè + mof + pmo? 


[8] 


Following the general form of Equation 3, 
the noncentrality parameter \ necessary to 
estimate the power of the test is the ratio of k 
times the right-hand term in the numerator 
of F to the denominator of the F. 


` = knpmod/ (cè + moè + pmo?), [9] 


where & is the number of treatments, n the 
number of Ss per group, p the number of 
independent trials per S, m the number of 
independent FFA determinations per trial, and 
where eê is the component of variance due to 
differences in duplicate measures on the same 
trial; o/ is the component of variance due to 
intraindividual variability between trials, o 
is the component of variance due to true long- 
term differences between Ss within groups, 
and of = X B#/k is the hypothesized con. 


ponent of variance due to true treatment 
differences, 


A search of the literature 
the magnitudes of com: 
associated with measurement error, cê, intra- 
individual variability across trials, on and 
with average S differences within groups g2 
(Hollister & Overall, 1964). From Table 2 ; the 
following values are obtained: c = 302.5 
af = 18,809.3, 0. = 5,683.5. It is obvious 

from the summary table that the component 
of variance due to within-trial errors of meas- 
urement is negligible, cê = 302.5. Only values 
of m = 1 and m = 2 are considered in the list 
of permissible experiments, since it is obvious 
that further increasing the number of duplicate 


yields estimates of 
ponents of variance 
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determinations would be a waste of resour® 5 
and should reduce overall power 1m testing 
treatment differences. ‘sible for 

All experiments considered permissib’¢ p- 
testing treatment differences within w 
striction of $2,000 total expenditure are r 4 
in Table 3. At $40 for training, $10 oe 
single trial, and $2 for each laboratory og, 
termination, the minimum cost per 5 1S ESS 
This means that the maximum nee 5 
per group will be 19 if one is restricte «yu 
total cost of less than $2,000. The mim". 
number of Ss which will yield necessary ie per- 
group error estimate is 2 per group. 


sen 

missible experiments in Table 3 were cio 
TABLE 3 
A 
k r wi g X Power | g(P)/Fa 
L 

9 
2] 2/1] 38] 162 12 9 
2] 2) 2) 32] 159 12 28 
2) 3) 1] 24| 231 21 28 
21 3| 2/20] 226 21 AL 
2) 4) 1/17) 204 29 41 
2) 4] 2/15] 288 29 ‘52 
2) 5/1113] 3.49 38 ‘51 
21 S| 2] a| 337 35 ‘60 
2) 6/111] 10] 395 43 39 
21 6) 2] 9| 385 42 ‘67 
21 7/11] 8] 433 AT ‘65 
21 7) 2) 7| 417 46 "13 
2] 8/41] 7) 47s 51 A 
2) 8|2] 6] 452 50 5 
2) 911] 5] 473 52 5 
2) 9) 2] 5| 475 52 82 
2/10} 1 5 5.26 56 ih 
2) 10] 2] 4| 479 53 83 
2) Il a | wl gos 37 16 
2 uiz 3 4.58 52 Bt 
2 12 1 3 4.98 55 gh 
211212] 31 doo 55 ‘81 
2/13) 1] 3] 539 58 14 
2/13) 2) 2| 428 50 18 
2 14] 1 2 4.59 53 8 
2 14] 2 2 4.61 53 82 
2 iSi a 2| 492 55 60 
2115/21] 1| 304 37 63 
2/16} 1] 4] 393 39 63 
2) 16) 2) 1] 3% 39 65 
2)w7lal ail 343 40 66 
2/47) 2] i| gas 44 68 
2) 1% 41 1 3.63 46 68 
2) 18) 2) i| 365 46 a 

21991 a 1 a) Bias 48 ie 

t PAA 

n 1S i pa this table, k is the number of treatm uprlo 


f 
mber of S: he number © gent ys 
per S ea Rer S each trial, § ie the mumber of indep ated 
Equation 9, | the noncentrality parameter © 


MAXIMIZING POWER RELATIVE TO COST 


by Systematically varying z between 2 and 19 
and m between 1 and 2. With $ = 2 treatment 
Stoups, the number of trials per S was calcu- 
lated as the integer portion of 


$ = (2000 — 40nk)/(10nk + 2umk). 


Consideration of expected elevation in 
Serum-free fatty acid levels suggested that a 
erence of 100 units between drug and 
Placebo would be meaningful and important 
© detect. The component of variance due to 


“eatment factors was computed using Bı 
5B; = 100/2: 


oa = (50? + 502)/2 = 2500. 


R rom values m, n, p, and k and from 
= ses eê = 302.5, of = 18,8093, of 
a and o? = 2,500, the values of non- 
emis Parameter \ were computed for each 

Ih ssible experiment by use of Equation 9. 
Ny S example, the df are Ny = k = 1 and 
Va io aaa 1). Entering Table 1 with Ns, 
Permisi A, an estimate of the power of each 
'Ssible allocation of resources was ob- 

ft aini nder the conditions imposed by costs 
tions ning, testing, and laboratory determina- 
Of eg and in view of the relative magnitudes 
Vario; g Ponents of variance associated with 
test op  OUrces of error, the most powerful 
Would treatment differences in this example 
three “Pear to involve 13 Ss per group with 
laborate “pendent trials each and only one 

Th oy determination per trial. 

Powe, poue 2 are presented graphs of the 
Yong , °° Single and duplicate FFA determina- 
tg. GY Dumber of trials per S with number 
post spo ermined by the restriction that total 
tat een Not exceed $2,000. It is apparent 
Produce tions in the allocation of resources 
St of p Marked differences in power of the 
Sin, eatment differences. In this example, 
“ase of Practical problems, the differential 
Brin, Various sampling units renders one 
R tt ie allocation of resources most power- 
te the G SO Interesting to note that, because 
Ania 8h reliability of the measurement 

iri Within trials, the allocation of re- 
ty, sist two determinations per trial (m = 2) 
rf alloc, ently less powerful than correspond- 
hos tl ations of resources such that m = 1. 


| he Simple error of measurement been 
| 
| 
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0.70 


0.60 O Indicates m = 1 


@ Indicates m = 2 


0.50 


Power 


5 10 1 2 2 30 35 4% 
Number of repeated determinations per subject 


Fic. 2. Power of F test for various numbers 
of trials per S. 


greater, duplication of lab determinations at 
$2 apiece might have proven advantageous, 
The Zero-Overhead Case. An important 
question in psychological research is con- 
cerned with whether or not to test each S more 
than once in order to increase the reliability 
of measuring some highly variable perform- 
ance. If there is a fixed cost associated with 
each test, but no difference in cost associated. 
with whether the testing involves the same 
or different Ss, then the answer is that it is 
unquestionably better to use more Ss and 
test each one only once. This is true no matter 
how unreliable the measurement involved. 
Consider an investigator interested in evalu- 
ating the effect of a new tranquilizing drug 
on serum FFA level, as in the former ex- 
ample, except that there is no separate cost 
for training. Each test trial costs $10, whether 
on the same or different S. Since it is recognized 
that laboratory errors are negligible, no dupli- 
cate determinations from each trial are con- 
sidered. Assuming independence and normality 
within Ss, a simple hierarchal design is 
appropriate : 
oè + mod + nmo? 
~ stm 


P= 
The noncentrality parameter A can be com- 
puted as follows for testing treatment effects: 
d= knmoè/ (cè + mo). 


Suppose that $800 were available for the 
research. At a minimum cost of $10 per S, the 
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TABLE 4 

k m m ? `A Power 

20 1.51 12 
A > i 18 1.48 12 
5 2 1 16 | 145 12 
A 2 1 14 1.42 12 
5 3 i 12 2.06 20 
2 4 1 10 2.63 27 
2 5 1 8 3.10 33 
2 6 1 6 3.38 36 
2 8 1 5 4.21 AT 
2 10 1 4 4.78 53 
2 13 1 3 5.39 58 
2 20 1 2 6.56 71 
2 40 1 1 8.07 82 


— is table, k is the number of treatment groups, n» 
i: orre me a per group, m is the number of duplicate 
Ai ee per S each trial, p is the number of independent trials 
per S and A is the noncentrality parameter calculated using 
Equation 9. 


maximum number of Ss per group (drug and 
placebo) would be 40. The minimum number 
of Ss per group would be 2, with 20 trials per S. 
The set of permissible experiments includes all 
combinations of m and n between these ex- 
treme values such that total cost is held to 
$800. These permissible experiments are listed 
in Table 4. The general conclusion is that, if 
there is no additional special cost associated 
with the use of different S for each determina- 
tion, the most powerful test results from using 
a maximum number of Ss with only one de- 
termination each. This conclusion is valid 
no matter how unreliable tl 
involved. 

Use of Î in Estimatin g Power. The power of 
an F test depends upon the proportion of the 
F’ distribution lying to th 


i he right of Fe (see 
Fig. 1). While integration of the noncentral 


F' distribution provides a problem for the re- 
searcher without a computer handy, it is 
obvious that a central esti 


mated value in the 
F’ distribution provides a good indication of 
the general location of that distribution, We 


will define the estimated F ratio as the ratio 
of E(MS)s for treatment and error. The farther 
that the estimated F ratio exceeds Fa, the 
larger will tend to be the proportion of the F’ 
distribution which will be found to lie beyond 
Fa. Thus, the ratio E(F)/F, provides an ap- 
propriate index of the power of any F test. 
The Ê ratio can be estimated from the 
E(MS)s, and estimates of components of vari- 


he measurement 
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ance included in them: 


oè + mae + pmae + npma . 110) 
oè + mof + pmo? 


; om 
The estimated Ê ratio can be pn E 
E(MS) values tabled for most S in a pre 
designs. In the example considered 302.5, of 
vious section, estimates of ce = © = 2,500 
= 18,809.3, c = 5,683.5, 


ies 


and a ower 
were obtained from the eratan a A a 
of tests representing different w Far 
resources are presented in T able Jefined in 
same set of permissible experimenta i mated 
terms of m, n, and p, the values © ting iP 
Ê ratios were computed by subat specife? 
Equation 10. The ratio of / to Fa ae expeti 
df was computed for each permis! the right 
ment, and the results are entered in Jot 0 
hand column of Table 3. A gee ower 
relationship between calculated 3, The Jine? i 
and F/F, ratio is shown in Figure 3 calcula d 
correlation between power of Tai 4 
as integral of noncentral F’ distri = pi 
relative power as estimated by Í/F a5 

to be .988 in this example, These vroximal ie 
gest the usefulness of the #/Fa PP” issib f 
as a basis for comparing alternative z ection © 
experiments. As can be seen by InSP® << ap 


sults 5°? 


s 
Table 2 Ce, y 
the right-hand column of Table resulted * 
proximate procedure would have 
0.90- g 
0.80 
4 £ 
0.704 5 o? 
0.60- °g 
© cond 
i 0.50. °° 
= 0.404 o 
w 
0.304 
o 
0.204 
0.10 
8 “0 
0.60 © 


0 
0.10 0.20 0.30 0.40 0.5 


Power f petwe®” 
Fic. 3. Scatter plot of relationship 
power and E(F)/Fa index: 


mS 


MAXIMIZING POWER RELATIVE TO COST 


selection of an experiment involving 13 Ss per 
Stoup with three independent test trials each. 

€ end result is identical to that obtained 
Y calculation of exact power values. In cases 
Where df or X values are not adequately ap- 
es in the tables, the P/F, approxima- 
a Should provide an adequate basis for 

Paring alternative permissible experiments. 


c sala > 
etna for purposes of designing experi- 
ents, rather crude approximation in estimat- 


Ng relative 


Buideline for power should provide a useful 


@ rasan j avoiding unfortunate allocation 
combine and for choosing an advantageous 
ation, 
ae Figure + are presented graphs of the 
Per § T plotted against number of trials 
> -nese curves can be compared directly 
Power curves presented in Figure 2 
tials ae 5 data. The curves peak at three 
a single ot and the consistent superiority of 
ticular determination per trial in this par- 
Houses Gee is apparent. Comparison of 
the Sin a 4 makes obvious the utility of 
Values rad F/Fa ratio where tabled power 
re not available. 

F ty Formulas for Optimal Design. Sev- 
Of a eee have worked on the problem 
a simple yA of resources in an attempt to find 
Irect way to calculate the optimal 

each type of experimental unit. As 
we can find, others have not actually 
the ogical, but complex, criterion of 


a 


Useq 


@ Indicates m=2 
O Indicates m= 1 


wn 
EPI Fee 


5 10 15 20 25 30 35 40 
Number of repeated determinations per subject 


F 4 
78: g, E(P)/F, index for various numbers 
of trials per S. 


349 


power as a basis for optimizing. One approach 
which may yield results highly similar to those 
based upon maximum power is that of mini- 
mizing the total error variance (Brooks, 1955). 
For a given true treatment difference, reduc- 
tion in error will increase the estimated F 
ratio. 


oè + moe + pmo? + npmoè 


F= 7 2 
oè + mot + pmo? 


The total cost of the experiment can be ex- 
pressed in terms of the costs of the different 
kinds of experimental units: 


C = nC + pnCe + mpnCs 


Using methods of calculus, it is possible to 
solve for values of $, m, and x which will 
minimize the denominator of F relative to cost. 
It will be noted that the results from this ap- 
proach do not take into consideration the 
magnitude of the hypothesized true treatment 
differences, df, or alpha level. Hence, the 
minimizing of error variance alone will not 
always result in the most powerful test. It is, 
nevertheless, a useful approach with which the 
psychologist should become familiar. 

We have attempted an analytic solution for 
the problem of maximizing the ratio of esti- 
mated / to Fa subject to fixed cost. The La- 
grange method was used and a solution was 
found for the problem represented in Table 3. 
We have concluded that the same solution 
can be obtained more easily through tables 
to evaluate power of alternative permissible 
experiments, or through comparison of //F, 
ratios for alternative permissible experiments. 
Construction of a table of alternative experi- 
ments, such as Table 3, has the advantage of 
displaying changes in power for comparison 
among various practical alternatives. In some 
cases, reasons other than economic may sug- 
gest the use of a combination which is not the 
single most powerful. Through the tabling of 
alternative permissible experiments, a good 
alternative can be chosen. 


REFERENCES 


Brooks, S. The estimation of an optimum subsampling 
number. Journal of the American Statistical Associa- 
tion, 1955, 50, 398-415. 

FEDDERER, W. T. Experimental design, New York: 
Macmillan, 1955. 


350 


FELDT, L. S., & Manaup, M. W. Power function 
charts for specification of sample size in analysis of 
variance. Psychometrika, 1958, 23, 201-210. 

Fisner, R. A. The general sampling distribution of the 
multiple correlation coefficient. Proceedings of the 
Royal Society, Series A, 1928, 121, 654. 

Garro, J. Repeated measurement designs and counter 
balancing. Psychological Bulletin, 1961, 58, 46-54. 
HOLLISTER, L. E., & OVERALL, J. E. Sources of varia- 
tion in repeated measurement of plasma free fatty 
acids in rested fasting subjects. American Journal 

of Clinical Nutrition. 1964, 15, 149-157. 

Lınpgounsr, E. F. Design and analysis of experiments in 
psychology and education. New York: Houghton 
Mifflin, 1953. 


JOHN E. OVERALL AND SUDHIR N. DALAL 


PATNAIK, P. B. The non-central chi-square and ee 
tributions and their applications. Biometrika, 1 
36, 220-232. wer 

Pearson, E. S., & Hartiey, H. O. Charts of the po 
function for analysis of variance tests, derive 951, 
the non-central F-distribution. Biometrika, ` 
38, 112-130. clion. 

PEARSON, KARL. Tables of the incomplete bela fi 193 j 
Cambridge, Eng.: Cambridge Univer. Press, Psycho 

Satrertuwarre, F. E. Synthesis of variance. s) 
metrika, 1941, 6, 309-316. im 

Winer, B. J. Statistical principles in experi 
design. New York: McGraw-Hill, 1962. 


ental 


(Received January 22, 1965) 


Psychological Bulletin 
1965, Vol. 64, No. 5, 351-364 


DEFINITIONS AND METHODOLOGY IN THE 
STUDY OF AGGRESSION* 
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The literature on aggression and hostility is viewed in the light of 2 major 
criticisms: (a) Definitions of terms with obvious denotations of hypothetical 
constructions are attempted in purely operational terms, (b) The phenotype 
of aggression has impaired a sound investigation of the many genotypic de- 
terminants of aggressive behavior. Present trends of studying aggression as an 
end result of a variety of stimulus, dispositional, and reward variables are 
considered, and some amplifications are suggested. 


The issue of aggressive tendencies and ag- 
8tessive behavior has long been a source of 
Particular interest to psychologists and es- 
Pecially in the last few years has produced 
“everal exhaustive and insightful treatments 
(Berkowitz, 1958, 1962, 1964; Buss, 1961; 
in vel 1959; Pepitone,*1964). Nevertheless, 
in Spite of the great need for, and interest 
det a more complete understanding of the 

<tminants of aggression, we are still on 
aie Uncertain ground as far as relevant 
®oretical interpretations are concerned. 


F 5 r 
TUstration-4 ggression Hypothesis 


h The Original frustration-aggression (F-A) 
g ethesis (Dollard, Doob, Miller, Mowrer, 
condi” 1939) defined frustration as “that 
S ition which exists when a goal-response 
«S interference [p. 11],” and aggression 
an 22 act whose goal response is injury to 
11) St8anism (or organism-surrogate) [p. 
One S hypothesis then asserted a one-to- 
sion ction between frustration and aggres- 
in la although, and this is often overlooke 
that" Criticisms, the authors did not argue 
Obser aggression in such cases is always 
the able. N. E. Miller (1941) rephrased 
as ,Ptesumed relationship in such a manner 
tional, dmit, in place of the heretofore opera- 
to . Y definable aggression, an “instigation 
bein: 8gression” which can be conceived as 
hier either so weak or so low in response 
Father Y as to be undetectable. Brown and 
tion © (1951), in turn, modified the defini- 
i of frustration from its original meaning 
at = Present paper was completed with the help 
aq, National Science Foundation Research Grant 
"gma Xi Grant-in-Aid. 


of “interference with a goal directed activity” 
to a hypothetical, internal arousal response 
to certain stimulus situations. This response, 
apparently largely autonomic in nature, could 
then serve as a mediating stimulus for ag- 
gressive responses. The objective definition of 
frustration has been questioned repeatedly 
(see Maslow, 1941; Pastore, 1952; Rosen- 
zweig, 1944). Among more recent writers, 
Berkowitz (1962) and McNeil (1959) con- 
tinued to lean toward the F-A hypothesis, 
McNeil (1959) continued to view aggression 
within the framework of the F-A hypothesis, 
though with some modifications. He drew 
freely upon psychoanalytic concepts and 
learning principles, but his overview aimed 
more at description than at theoretical analy- 
sis. In his recent book, Berkowitz (1962, p. 
30) first noted the failure to define frustra- 
tion properly, then (p. 62) approvingly cited 
McDougall (1926) and viewed having one’s 

toe stepped on as a frustration of the impulse 

to self-assertion. He also noted (p. 138) that 

aggression need not always be the dominant 

response to frustration but may be “inhibited” 

under certain conditions. 

One is thus forced to conclude that the 
conceptual value of even the modified ver- 
sions of the hypothesis advocated by McNeil 
and by Berkowitz is no longer discernible, 
since both its independent and dependent 
variables are left so protean as to be refrac- 
tory to operational definition, a sure sign that, 
with due recognition of its influential role 
in experimental research, it has outlived its 
usefulness. This conclusion is shared by such 
diverse sources as Seward (1945a), Men- 
ninger (1942, p. 296), and Buss (1961), 
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who arrived at it by the different routes of 
animal behavior, psychiatry, and social 
psychology. 


Physiological and Genetic Determinants of 
Aggressive Behavior 


The attempts to subsume aggression under 
one global theoretical edifice have assumed 
other guises. Experiments by Beach (1945) 
and Beeman (1947) attested to a direct 
relationship between hormonal balance and 
fighting behavior, but Scott (1958) reported 
successfully training mice not to fight and, 
more generally, produced convincing evidence 
against the so-called “fighting instinct” in 
mammals. Bevan, Daves, and Levy (1960) 
indicated that testosterone concentration is 
less effective in producing fighting behavior 
in rats than previous fighting experience, 
even though the hormonal composition ap- 
pears to weigh heavily in the initial shaping 
of response patterns. Clark and Birch (1945) 
noted that physiological explanations are of 
little value in accounting for complex human 
behavior. Scott and Fredericson (1951) ex- 
tensively documented their argument that 
hormonal factors are irrelevant in fighting 
behavior subsequent to its acquisition, Even 
Tinbergen (1953) found no grounds for 
postulating a general fighting instinct among 
animals. z 

We can therefore concur with McNeil 
(1959, p. 198) in the well-founded conclusion 
that the explanation of aggression in humans 
is not furthered by the invocation of physio- 
logical or genetic factors, 


Redefinition of Aggression 


The term aggression has not always been 
used unequivocally, Even if we disregard the 
distinction between aggression and “aggres- 
sive behavior” and use both terms synony- 
mously to denote only overt behavior, several 
points remain unclear. First of all, traditional 
definitions of aggression as a response having 
for its goal the injury of a living object are 
not necessarily applicable to so-called instru- 
mental aggression, or behaviors aimed at re- 
moving obstacles between the aggressor and 
a goal, where injury may result only inci- 
dentally. Ambiguity exists within the nar- 
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rower traditional definition. Buss (1961), a 
instance, used a somewhat similar omer 
—‘“a response that deliver noxious stimul to 
another organism [p. 3]”—but soon ran E 

a contradiction when (p. 32) he consider k 
Gellerman’s (1956) word association e 
and his own scrambled sentences task re 
measures of aggression, though both tae ‘eo 
clearly “intransitive” in that they na ure 
direct target. Thus, these devices me 
what might more appropriately be ae 
46 “potential aggression” or “aggressive obe 
ency,” which we might define as the Pr ag 
bility for a given individual to emit ee 
gressive response in a given situation. A 4 to 
lar question might be raised with sna 0 
various well-established fantasy measur aWv- 
aggression, where noxious stimuli are they 
ered in a purely autistic manner in that © pi 
not only do not result in actual injury edo 
object (nor in its retnoval, where obstru pre 
removing aggression is concerned), v pility 
sumably have no initial subjective PtOP* ch 
of reaching the object. Questionnaires, “pout 
aim at assessing a person’s opinions cal Í 
another individual are equally equiv’ y a 
their meaning. Ts it justifiable to clas f tne 
aggression an opinion which, in terms cœ % 
respondent’s perception, has no s a pre 
affecting the target of his remarks? pa sul 

occupation with violent materials ™¢? put 
by, say, a sentence completion task, ™ ia “ten 
does not necessarily denote aggress Yo of 
dencies, that is, an increased likelibo? of 
committing aggression in the present of 
suitable object, Buss classified thw? gsi 
uncompleted aggressive acts as 28% ca 

but it might be argued that this class atent 
does not account for acts whic 


the delivery of a noxious stimulus as # 


sai a 
y the aggressor is sufficien gre 
: it seems possible to view ind E 
Sion as a pure response, similar tO “spis g 
miming, and measure its intensity: ) Le 
the view proposed by Buss (P. 3 is qv 
decried the inclusion of “intent” i" p 


first sight, 


} 
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for a purely behavioral definition. But here 
too problems of definition arise: When does 
the sudden extension of the forearm have the 
charactersitics of aggression as opposed to, 
say, physical exercise? And can mere verbal 
Production of a class of words classified as 
aggressive be taken as indicative of aggres- 
Sion, unless inferences about intent and ex- 
Pectations are made? Feshbach (1964) also 
Was careful to separate the mere expression 
of anger (and, presumably, derogation) from 
aggressive behavior. The facilitation of ag- 
8tession by anger, on the other hand, is 
readily explained by conditioning principles. 
Little attention has also been paid to the 
question of the victim’s awareness that he 
's the object of an aggressive act. Where the 
Boal of the aggressive act is injury to the 
Victim and such an act is completed success- 
ully according to aggressor’s evaluation, is 
Such injury rewarding even if the victim does 
Rot perceive the source of his injury, or 
Pethaps does not even realize that he has 
eae injured? A problem which follows di- 
fe from the issue of effectiveness of ag- 
cet Sion and awareness of the victim (as per- 
ved by the aggressor) is that of measuring 
oby tensity of the aggressive act. The most 
lous indicator of intensity is the strength 
the Tesponse emitted by the aggressor. 
ut where the aggressor perceives two or 
Pi Objects of aggression as differentially 
eee oe and, wishing to obtain equivalent 
may a gauges his aggression accordingly, it 
aggre e misleading to maintain that he was 
Taa more violently against a more 
verful than a less powerful object. , 
en a; the definition of aggression is un- 
Teles oere an aggressive act is committed 
of q tly, under orders, or perhaps a sense 
ju, $Y- Granted that much malice and in- 
D T rationalized by the attacker as just 
to ¢ Ment or an act of obligatory adherence 
Sasa norms, it would be begging the 
Committ to assert that aggression is never 
a ed without some satisfaction to the 
essor arising out of injury to the victim 
Dlan Conversely, that the interruption of such 
Stmaby aggression is always noxious. Pre- 
of Oly, Abraham welcomed the interruption 


» has been implicitly or explicitly defined 


| tio aggression against Isaac. Aggression, 


353 


in a variety of ways, none of which encom- 
pass all others. Feshbach (1964) made a 
clear and telling point for the inclusion of 
motivational factors in the definition of 
aggression. Much of the difficulty was mir- 
rored by the attempts by Buss and others to 
assign behavioral definition to what is, in 
fact, a hypothetical construct. In essence, 
then, no behavioral act as such can consist- 
ently be called aggression, unless inferences 
about intention or expectation regarding its 
outcome are made. In the body of this paper, 
the term aggression will be used to denote 
behaviors which: (a) are transitive, that is, 
are directed against some object; (b) have a 
subjective probability > 0—on the part of the 
attacker—of reaching that object, and either 
removing it from attacker’s goal path, or 
imparting a noxious stimulus to it, or both. 

This definition includes even totally unsuc- 
cessful responses (provided the attacker had 
some initial expectation of success) but does 
not include certain opinions or propensities 
as measured by questionnaire or projective 
devices, except where evidence or reasonable 
assumption indicates that respondent expects 
some injurious effect to result from his actions. 


Some Recent Theoretical Statements 


Buss (1961, p. 198 ff.) attempted a be- 
havioral approach to aggressiveness and de- 
fined the latter as the habit of attacking. He 
listed four determinants of aggressive habit 
strength, and defined them as follows: 

1, Antecedents of aggression. The number 
of times the subject has been angered, at- 
tacked, or annoyed. 

2. Reinforcement history. Degree and fre- 
quency of reinforcement or punishment. 

3. Social facilitation. Cultural and subcul- 
tural norms regarding aggression. ; ; 

4. Temperament. Such personality varia- 
bles as impulsiveness, intensity of reaction, 
activity level, and independence (tendency 
toward self-sufficiency and resistance to group 
oe, Buss offered little in the way of 
a consistent theoretical reformulation, though 
in many respects he appeared to adopt Hul- 
lian concepts. He rejected, it is true, anger 
(p. 13) and frustration (p. 20) as the specific 
drives underlying aggression, but his use of 
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“tendency” bears strong resemblance to 
“drive.” Also, he spoke of “a sharp drop in 
anger level” as “reinforcing” (p. 199). Since 
his first determinant, antecedents of aggres- 
sion, was apparently distinct from the sec- 
ond, reinforcement history, what appeared to 
be meant by the former was the total num- 
ber of “trials,” that is to say, the number of 
times an individual has been aroused. This 
variable appears related to Estes’ (1950) and 
Guthrie’s (1952) contiguity principle of 
learning, or it may belong more properly in 
the category of learned drives. The second 
determinant is synonymous with habit as de- 
fined by Hull, namely, a function of the num- 
ber of reinforced trials, 

Buss’ use of this third determinant, social 
facilitation, was similarly ambiguous. If cus- 
tomary social rewards of aggression were in- 
tended, then this variable was identical to 
the reinforcement variable with social ap- 
proval as a secondary reinforcer. On the other 
hand, if the degree of facilitation or inhibition 
of a specific aggressive act was meant. it 
would have been more proper to consider it 
an incentive variable, analogous to Hull’s K, 

An insightful and rigorous treatment was 
offered by Pepitone (1964). As did Buss 
(1961), he rejected the circularity of the F-A 
hypothesis and pointed to the importance of 
an evaluation of the stimulus. Pepitone, how- 
ever, went beyond Buss and examined the 
cognitive implications of a potentially ag- 
ression-producing situation. He began with an 
analysis of the “threat” stimulus postulated 
earlier by Maslow (1941) and Rosenzweig 
(1944). Such threat may be subdivided into 
the thwarting of various needs such as status 
or affiliation, absence of cognitive balance or 
congruence, or, finally, 
uniformity, 

Pepitone concluded that even this finer 
grained analysis does not preclude equivocal 
interpretations of extant experimental data 
and proceeded to report a series of experi. 
ments in which such specific potential ag- 
gression inducers as boastfulness and threat 
against the subject’s self-evaluation on the 
part of an instigator were studied in inter- 
action with that person’s perceived actual 
worth. By demonstrating the effects of such 
variables as cognitive validation and self- 


, social pressures toward 
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evaluation conflict, Pepitone avoided the Me 
fall of a global interpretation. While his an r 
sis deliberately confined itself to a ae 
range of inquiry, it laid some wet the 
groundwork for a precise assessment © A 
several variables which determine the ee 
rence of aggressive acts. Feshbach’s (i ith 
most recent statement came to grips tent 


e i way 
studies of aggression and went @ long 


Sa þe- 

toward resolving them. He distingulsh le 
nee: Z an ; 

tween incidental, instrumental, troducing 


aggressive acts, thus clearly rent -ption 
motivation and specifically inferred int cially 
into the paradigm. He also noted how i ‘ome 
aimless acts of a motor activity may pa 
reinforced by manifestations of pain OF "while 
and thus develop into aggressive habits. ssion 
he distinguished such “expressive” 488°" i 
from instrumental and hostile aggression h. 
category could logically be dispense e of 
When injury is the accidental outer ares: 
some expressive motor act, it is not 4°" in- 
sion; when it recurs as a function ms unex 
forcement, either resulting in some in, } 
pected rewards or by observation of r i 

falls into one of the two other catego"? gute 
We might go one step further a” jon of 
that even the remaining dichotomizatt 

aggression into instrumental and bOS’” jon 


u 
troduce: ifici i de 
s artificial categories, bas Feshba 


seems to be restricted to humans, @PP°F, cto! 
be at direct variance with his thret oti of 
theory of aggression, There is no 4 ress, 
parsimonious reason why hostility id 4 
dered, injury-directed aggression peer 
viewed as being different from ordine po 
strumental aggression, What may ‘ h 
simply the reinforcing stimuli whic e w 
such behavior more or less likely i" 

ture. yy 


r 
ene!" g 
The present argument is more 8° of 2% 


applicable to most previous analyses _, 


. 5 
gression, where superficial—thous te 
Jag or tant—similarities in response one 
ed to unwarranted inferences @| inf 
antecedents, i 


; + reas 
The pitfalls of such phenotyp!e the 
(not at all a recent observation, 
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become obvious if we look at such relatively 
Simple behavior as eating. Leaving aside 
hypothalamic lesions, we may still ask whether 
the organism is eating because it has been 
deprived of food generally, or perhaps just 
vitamin C or salt, or whether it eats to avoid 
shock, and so on ad libitum, 


í va 
“Hostility” as a Theoretical Construct 


A large number of experiments deal with 
the differential effects of instigation upon 
high-hostile” versus “low-hostile” individ- 
uals, Hostility is usually defined in a variety 
9l Ways, such as scores on one or another of 
© current hostility questionnaires, which 
Fequently show only moderate correlation, 
responses of projective or completion devices, 
end, judgments by other persons. This global 
notion of hostility was questioned by Buss 
(1962), Earlier studies by Buss and his 
associates (Buss & Durkee, 1957; Buss, 
T & Baer, 1956), attempted to parti- 
cat responses on an inventory into various 
+ ©80ries such as suspicion, indirect hostil- 
en’, cte., and extracted two factors roughly 
uivalent to hostility, a tendency to view 
the environment as inimical, and aggression, 
e tendency to inflict injury. 
raves, eminently useful approach of first 
Tae a clear distinction between aggres- 
tili and hostility and then factorizing hos- 
. Y into several components was vitiated by 
os later comparison of hostility with aggres- 
tili and by the assertion (p. 204) that hos- 
y is more specific and focused than ag- 
pression, In terms of this partition, the selec- 
We racist would be considered as hostile, but 
: Sociopath, who aggresses indiscriminately, 
uld not. Tn all, one would surmise that 
ote formulation should be able to dispense 
sist With the construct of hostility. If, con- 
€nt with a Hullian formulation, we simply 
Sume that aggression has been rewarded 
oa frequently with regard to some stimuli 
tial to others, then no postulate of differen- 
Personality characteristics is required, 
lai differential aggression is adequately <4 
one in terms of stimulus, habit strengt A 
tig ve value, inhibition, and a discrimina- 
n gradient, 
Omewhat surprisingly, as measured by 


Westionnaire, under certain conditions hos- 
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tility may be a negative predictor of the par- 
ticular kind of aggression which might be in- 
ferred from punitivity toward an offender. 
Kaufmann and Marcus (1965, in press) 
found that high-hostile college subjects dis- 
played greater leniency toward a juvenile de- 
linquent described as a college student than 
low-hostile subjects, but were more punitive 
than low-hostile subjects when the same of- 
fences were ascribed to a door-to-door sales- 
man, thus bearing out some correlational 
findings by Adorno, Frenkel-Brunswik, Lev- 
inson, and Sanford (1950) and experimental 
results by Steiner and Johnson (1963). It is 
also a common observation that highly hostile 
individuals sometimes contract allegiances 
with ties of intense personal loyalty toward 
other, perhaps similarly hostile, individuals. 
On the other hand, the tolerant liberal is often 
quite hostile toward the intolerant, jingoistic 
know-nothing. It is not clear whether what 
we mean by describing an individual as hos- 
tile is that he tends to respond aggressively 
toward individuals or groups Possessing spe- 
cific stimulus characteristics, to perceive a 
specific kind of response as socially desirable, 
or even to express his aggressive feelings with 
greater freedom than a so-called low-hostile 
person. It is not maintained that hostile, 
extrapunitive, ethnocentric, or high F-scoring 
individuals do not exhibit some behaviors 
which set them apart from their counterparts 
along these dimensions. But it is highly 
doubtful that we can rely upon these classifi- 
cations to predict targets and intensities of 
aggressive behavior, irrespective of other 
variables. 


Emotional Arousal or “Anger” 


Much confusion has prevailed with regard 
to the determinants. and properties of anger. 
The observation that aggression is usually 
accompanied by anger makes all the more 
necessary a thorough study of possible causa- 
tive, rather than correlational phenomena. It 
is usually taken for granted that anger at 
least precedes aggression and, most likely, 
causes or at least facilitates it. The physio- 
logical nature of anger and the voluntary one 
of aggression have, for instance, led to the 
assumption that anger is classically condi- 
tioned to certain stimulus situations and is, 
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therefore, largely outside the control of the 
person. Thus, Buss (1961, p. 102) pointed to 
the preparatory role of physiological arousal 
toward subsequent aggression. The observa- 
tion, by Sears, Maccoby, and Levin (1957), 
that anger appears to be an instinctive re- 
sponse may be of value where infants are 
concerned but, as all explanations leaning 
upon the concept of instinct for explanation, 
is of little value in understanding and predict- 
ing complex human behavior. 

We are handicapped once more by the 
absence of a unified definition. Berkowitz 
(1964) spoke of anger in the absence of any 
aggressive behavior tendency. Buss (1961) 
clearly distinguished between the emotional 
aspects of arousal and the voluntary ones of 
aggression, while Schachter and Singer (1962) 
presented impressive evidence that anger may 
be largely a cognitive labeling of physiolog- 
ical arousal. Anger, though, is a term used to 
denote a multiplicity of phenomena, which 
are not necessarily related. 

1. Anger has been defined as the diffuse 
maladaptive agitation or rage response aig: 
played by lower organisms and poorly or- 
ganized humans upon being presented with 
an unsolvable situation. This anger response 
involves no evaluative or instrumental com- 
ponents, and the aggression to which it may 
lead is usually quite Nonspecific, perhaps 
even a chance component of random þe- 
havior. The response patterns observed by 
Janis (1951), Kardiner and Spiegel (1947) 
and Berkowitz (1962), as well as temper 
tantrums (at the early, noninstrumental 
Stage), fall within this classification Fesh- 
bach (1964) spoke of this undifferentiated 
emotional arousal as an instigation to hit 
rather than to hurt. However, the consum- 
matory hattire of the hitting Tesponse is open 
to question.” 


2. Anger recurs as a result of classical con- 
ditioning. Initially unlabeled emotional 
arousal has in the past of an individual’s ex- 
perience been conditioned to stimulus situa- 


2Jt is also not clear whether Feshbach’s observa- 
tion that restraint in infancy produces a desire to 
hit (rather than to hurt) does not more simply 
represent the kind of brusque motor activity which 
has previously freed the infant from other, possibly 
inanimate, restraints. s 
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responses. 


tions regardless of subsequent a 


When such a stimulus situation are 
should the arousal, which is then labele 
anger in the manner described by getai 
and Singer (1962). Since labeling has t e 
property of providing gratifying aang 
clarity, it may on subsequent occasions 
mediated in the absence of the “model.” to- 

3. The third possibility, one that here A 
fore apparently has been largely ignored, T- 
that of anger conditioned classically tO lly 
tain preparatory responses which may eee 
have been selected independently of en ss- 
arousal. The possibility of anger as & pe 
ically conditioned response should not, P 4 
ever, be interpreted as a necessarily ont 
nent pairing of certain stimulus situations Te 
emotional arousal. Recent findings bY Liu’s 
piro, Crider, and Tursky (1964) and in nde 
(1964) two-factor theory of classical ate 
tioning suggest that anger (or ae ee 
arousal in general) is not a built-in, U4 mes 
ble response to certain stimuli, but bec? in- 
established only if its presence fulfils 2 
strumental role, ggres” 
_ 4. Anger accompanying preparatory 25° Je- 
sive responses may also be initially an has 
liberately instrumental. Where aggression y 
been rewarded in the past, the subjec ja- 
well have learned that emotional atoUS*” ich 
beled anger, facilitated or intensified rks 
aggression; he therefore reproduces tag res” 
himself into a rage—thus energizing uto 
ent aggression, This susceptibility °? gjved 
nomic state to cognitive influences "h oym, 
some support from recent studies bY p 
Back, and Bogdonoff (1964). 

5. Again a rarely considered se 
anger may follow aggression—or incre". ing 
ter aggression—as a gratifying, guilt-r feel® 
response, The subject has aggress°’ smo” 
guilty, and rationalizes by producing he i 
tional arousal, perhaps claiming that imt 
still quite upset. More generally, 8°" pav? 
lation to anger following aggression ™ jes- 
dissonance or anxiety-reducing proper! the 

Berkowitz (1962, p. 160) combing 


pure drive notions of classical F- 
and the “ 


¡plith 
poss” at 


peor 
ds 
fi 
pure stimulus” theory of Zari 964) 


Py t 
(1948) and in a more recent paPe eath, 
Saw aggression as being evoked by 2 er V’ 


with appropriate stimulus qualities eit 
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Cause the subject is angry or because partic- 
ular stimuli have acquired cue value for 
aggressive responses from him.” He distin- 
guished between anger-produced and habitual 
aggression but continued to place heavy em- 
phasis upon a preexisting state of internal 
arousal, despite rather ample evidence that 
Physiological factors play an unpredictable 
role in aggressive behavior even in animals 
(Beach, 1945; McNeil, 1959; Scott, 1958)— 
excepting, as noted earlier, the actual acquisi- 
tion of such behavior in the repertoire of some 
of the simpler organisms (Beeman, 1947). 
We have, of course, a considerable body of 
experimental literature which shows interest- 
Ing relationships between physiological arousal 
and the Occurrence of aggression (see Hokan- 
Son, 1961). The well-known study of Schach- 
ter and Singer (1962), showing the effects on 
a model upon the labeling of arousal, has 
steady been cited. What is absent, however, 
S the determination of reliable antecedent- 
consequent relations between such arousal 
aggressive behavior. 
erkowitz himself cited ample evidence 
a Physiological arousal alone is not suf- 
: tent to produce aggressive behavior where 
Mtable cue stimuli are absent, and Pepitone 
64, P. 81), by implication, considered ag- 
8tession without anger as a function of at- 
ees at cognitive validation. Evidence for 
© Occurrence of aggression without arousal 
cubes from Scott’s (1958) mice, which, ac- 
Stomed to successful aggression, displayed 
arousal symptoms preparatory to aggres- 
we to the mass executioners of an extermi- 
co on camp, who (Höss, 1963) managed to 
ap it inconceivable acts of horror without 
È arent displays of anger or other emotions. 
iat ational bluntedness or stultification in the 
See * case and in others similar to it would 
to be an inadequate interpretation, since 
of these individuals displayed a normal 
ing of emotions, from joy to rage, in deal- 
ca Vith their environment other than the 
the? Situation. One might well venture, then, 
tr arduous step of relinquishing the con- 
ees, of frustration and anger as either 
siy, essary or sufficient conditions for aggres- 
c © behavior, Such a position does not, of 
eens! deny the occasional, or even usual, 
Sence of physiological arousal where ag- 
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gressive behavior occurs. It simply maintains 
that, to paraphrase Brown and Farber (1951), 
a nonemotional theory of aggression may be 
sufficient to account for the class of behaviors 
which are perhaps most economically cate- 
gorized as a subset of learned social inter- 
actions. In this formulation, emotional arousal 
covaries highly with actual or fractional an- 
ticipatory aggressive responses, but neither 
determines nor is determined by them. Where 
present, this stage of arousal, in turn, is per- 
ceived by the person, related to other stimu- 
lus elements in the situation, and appropri- 
ately labeled in accordance with past learn- 
ing. If anger is viewed as conditioned, not 
only to consummatory aggression but to frac- 
tional antedating goal responses, then the 
fact that an aggressive response seemingly 
becomes more likely in the Presence of anger 
no longer implies causative or even facilitat- 
ing effects of the latter. 


The Notion of Displacement 


The F-A group’s original concept of dis- 
placement implied simply that an object other 
than the frustrater became the target of (un- 
deserved) aggression. Underlying the process 
were two models: one in which the frustrater 
was feared and aggression was directed against 
a safer target, either a minority group (N. E. 
Miller & Bugelski, 1948) or, as in the well- 
known parable of the satanic and powerful 
boss and the angelic, defenseless wife, against 
a handy, equally safe object. Buss (1961, p. 
61) named this phenomenon displacement of 
target and ascribed it to a lower threshold, 
permitting aggression to be elicited by a 
wider variety of stimuli. The other model 
proposed by the F-A group is exemplified by 
Miller’s (1948) well-known study in which 
rats displaced striking behavior from white 
rats onto white dolls when the former became 
unavailable. Buss referred to this process as 
stimulus displacement and ascribed it, as did 
Miller, to a stimulus-generalization gradient. 

It should be noted that the two models are 
not clear, nor do they account for a great 
majority of aggressive behavior directed 
against seemingly nonfrustrating targets. The 
often discussed phenomenon of “scapegoating” 
seems to follow from the model of displace- 
ment of target, but makes no prediction 
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about which of the innumerable safe targets, 
including newly borns in perambulators, will 
finally be selected. Berkowitz (1962, p. 153) 
emphasized both safety and ethical aspects 
in the selection of a scapegoat, but the cri- 
terion of safety is directly opposed to that 
proposed by White and Lippitt (1960), who 
emphasized the self-esteem restoring aspects 
of aggression. Self-esteem, they argued, is 
restored only if a person can take some com- 
fort in his deed. The stimulus-generalization 
model is unapplicable, since no plausible a 
priori argument has yet been presented which 
establishes similarity between a frustrater 
and a scapegoat. Buss’ argument (1961, p. 
247) that scapegoating requires anger as a 
necessary condition was also contradicted by 
his own definitions, earlier established, be- 
cause it excluded the habitually hostile re- 
sponse of the prejudiced individual against 
the customary target, not necessarily in the 
presence of physiological arousal. Buss later 
(p. 258) did suggest that prejudice is most 
economically conceptualized as a learned 
habit, but failed to integrate this habit or 
response readiness with aggression. The argu- 
ment of social learning and imitation was 
further explored by Bandura and Walters 
(1963). Bindra’s (1959) formulation viewed 
displacement as the outcome of enduring re- 
sponse habits (similar, apparently, to Buss’s 
hostility) which can result in undiscriminat- 
ing aggression against a wide variety of tar- 
gets. Berkowitz (1962, p. 135) distinguished 
between both culturally learned hostility and 
culturally enforced, hostile behavior toward 
members of minority groups and the displace- 
ment upon a scapegoat arising from some 
immediate frustration. 

It is clear that both the target displacement 
and the stimulus displacement notions can 
easily become circular, A displaced target was 
attacked because safe and, having been at- 
tacked, must have been safe. An ultimate 
object of aggression is similar along some un- 
defined generalization gradient to an original 
annoyer and is attacked; conversely, having 
been attacked, it must have been somehow 
similar to him. Literature and one’s own ex- 
perience indicates that a disliked individual 
can, given proper provocation, acquire a sur- 
prising resemblance to a loathsome animal, 
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but the relationship of such an occasional 
metamorphosis to aggression is purely spe 
ulative. ; 
The experimental literature in some E 
stances supports a scapegoat hypothesis, ta 
is to say, selective punitivity against spect 
targets (Weatherley, 1961), in others ©” 
data indicate a general nonspecific height 
ing of aggressive tendencies (Berkow! er 
1959; Lesser, 1958; Lindzey, 1950). oie 
and Congdon’s (1955) data failed to SUPP 


Miller and Bugelski (1948) 
Landes, and Schaet (1959), 
cited in support of scapegoat theories, CAN" y 
conclusively be interpreted as indicating a 
selective heightening of aggressive are ers 
since attitudes towards the subjects Paie 
were not measured, which makes it cone ig 
able that antagonism against them Wê ted 
higher. Berkowitz (1960, 1961) pie! 
evidence that for highly prejudiced a on- 
discriminations between thwarting- a04 7 
thwarting-stimulus persons breaks 
Lesser (1958) also hypothesized that 
placement phenomena are largely 4° 
greater general extrapunitivity. T° ee 2 
aggression or aggressive tendencies ae pee? 
Presumably nonoffending target haye on 
viewed in terms of their observable oute 
but the understanding of the entire P from 
has suffered from efforts, originating syt- 
the emphasis placed upon the aggressiva nts 
come, to group widely different deter™ 
under the same arbitrary classification- 


dow™ 
dis- 


to 


Meanings of Catharsis 


The term “catharsis,” in relation to 
F-A hypothesis, was originally used t9 4 
an aggressive act followed by lowere“ — -cive 
quent aggression, This lowering of ager 
drive or, in terms of Miller’s (1941) pe ibe’ 
tion, of instigation to aggression was 
to a draining away of drive leading t° 28,60)» 
sion. Rosenbaum and de Charms Oea 
while studying the cathartic effects © 
ous aggression, found that the mere 4 sub 
tunity to respond to a frustrater lower? the 
sequent derogation of him, even wher on? 
original response was not an aggressiY' 4 aif 
(though this effect was present for 

esteem subjects only). 
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The issue of catharsis and the cathartic 
effect is further confused by conflicting ex- 
Perimental findings, Aggressive behavior has 
een found to lower subsequent aggression in 
some instances (Feshbach, 1955; Thibaut, 
1950; Thibaut & Coules, 1952) but in others 
as produced an increase in aggression (de 
Charms & Wilkins, 1963; Kenny, 1953). 
Feshbach himself (1956) obtained such find- 
ae in a later experiment and presented an 
: eresting analysis of the aggression-enhanc- 
ng or reducing effects of (aggressive) fantasy 
ore He hypothesized that aggression is 
tel artic where the subsequent situation is 
peoa to the situation involving the aggres- 
* © act, but that a second opportunity to 
E should lead to enhanced aggression 

cre the two situations are unrelated. 
sig 2gner (1944) argued that where aggres- 
cath 1s the outcome of habitual hostility no 
hse artic effect is to be expected; presumably, 
F g Sreater habit strength develops with 
er ated aggression, the strength of the lat- 
in, Should increase on those grounds. Accord- 
pa Buss (1961, p. 80 ff.), catharsis should 
is ase Subsequent aggression where anger 
resent but increase it in the absence of 
Cr because in the latter case inhibition has 

Owered, 
Stron, Charms and Wilkins (1963) presented 
Render evidence that hostile statements en- 
at the Subsequent heightened aggression and 

orch “Ame time clarified an issue raised by 
teduci el (1960) dealing with the instigation- 

org 8 effects of status restoration. In 
stap PPs experiment, the attacker and the 
whieh řestorer were one and the same person, 

made it impossible to separate the 
Fe Status restoration from those of at- 
teston, S changed image as a supportive status 
Char ĉr. From the Rosenbaum and de 
that oo (1960) results it might be inferred 
the atts equalization, that is, reduction of 
tute tacker’s status, may serve as a substi- 
Pesh Or victim’s status restoration (see also 
(19g3s°% 1964). De Charms and Wilkins 
‘Ubje, ) found that status restoration by the 
ater himself or by a vicar was positively 

‘pati to subsequent aggression, rather than 

Nsg as had been found by Worchel 
ie so Again, one is tempted to infer that 

Scial Connotations of retaliation against a 


angi 
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(deviant) attacker outweighed emotional as- 
pects of accumulated anger or anxiety. 

The evidence is thus truly conflicting. Ag- 
gression enhances or reduces subsequent ag- 
gression; a vicar’s aggression may serve either 
as a substitute (Rosenbaum & de Charms, 
1960), or as a model (de Charms & Wil- 
kins, 1963) for the subject’s subsequent ag- 
gression. However, the difficulties inherent in 
the interpretation of catharsis phenomena do 
not end here. After all, the cathartic effect 
may be due to any of the following reasons: 
(a) the subject has “let off steam” (an un- 
differentiated autonomic arousal model), (b) 
the subject has aggressed against someone or 
other (a hydraulic aggression model), (c) the 
subject has successfully aggressed against an 
indeterminate target (an expressive model), 
(d) the subject has successfully ageressed 
against the instigator (a self-esteem restora- 
tion or threat removal model), (e) the sub- 
ject has seen the frustrater punished, even by 
someone else, and cognitive expectations re- 
garding justice have been confirmed (a dis- 
sonance reduction model), (f) the subject has 
accumulated reactive inhibition or fatigue (a 
behavioral oscillation model), (g) the subject 
has developed guilt or aggression anxiety 
subsequent to aggression (an aggression- 
anxiety-with-delay model), (%) the subject 
perceives that a given amount of aggression 
is appropriate under the circumstances (a 
social reality model). 

Where a cathartic effect is ostensibly ab- 
sent, it may be the case that his aggressive 
habits have become stronger, that he perceives 
greater social desirability or tolerance for ag- 
gression, or that he perceives aggression to be 
the most effective way to deal with the situa- 
tion, etc. 

A series of experiments by Hokanson 
(1961) and his collaborators allows for inter- 
esting inferences regarding the role of expec- 
tations in determining the need for catharsis 
in order to produce the cathartic effect. Ho- 
kanson and Burgess (1962), for example, 
found that aggression against a high-status 
attacker did not reduce sympathetic arousal, 
while aggression against a low-status attacker 
did. These results appear to be directly rele- 
vant to the above mentioned variable of per- 
ceived or expected effectiveness of the attack, 
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Hokanson and Shetler (1963) found that 
when the subject is attacked by a high-status 
individual and does not expect an opportun- 
ity for retaliation his systolic blood pressure 
subsides at the same rate as when he was able 
to aggress. Though the authors discussed these 
results mainly in terms of substitute responses 
to frustration, such as withdrawal, an inter- 
pretation along the lines of perceived and— 
in the case of the high-status frustrater—ac- 
cepted social expectation appears quite plausi- 
ble. 

In any event, Berkowitz’ (1962, p. 196 ff.) 
exception to the absence of clear-cut inter- 
pretations to current experiments was well 
taken; too many variables have remained 
uncontrolled for any verdict regarding the 
notion of catharsis to remain undisputed or 
even to be meaningful. 

To sum up, the study of the form of social 
interaction termed aggression has been ham- 
pered by two major conceptual difficulties: 

1. The standard terms originating in psy- 
choanalytic theory and introduced to experi- 
mental psychology by the F-A hypothesis 
have been subjected to widely diverging defi- 
nitions, some of which have resulted in at- 
tempts at defining what are obviously hypo- 
thetical constructs solely in behavioral terms. 


2. The classification of superficially similar 


behaviors as one phenotype, disregarding or 
laboriously explaining different 


genotypes, 
. . = 

which, were it not for a priori categories, 
might have proved far more amenable to ex- 


perimental analysis. 
One need not object to the use of such 
terms as aggression, displacement, catharsis 
etc., as long as it is clear that these terms im- 
ply more than directly observable events. The 
admission of hypothetical constructs in the 
study of such complex social interactions as 
aggressive behavior need in no way detract 
from the rigor of empirical inquiry, 
to pursue such a study systematically, de- 
terminants other than those inherent in the 
stimulus itself must be investigated. While 
present assessments of habitual hostility as a 
personality variable may not allow for un- 
equivocal predictions of aggression, there can 
be no doubt that personality variables do 
play a part in determining aggression, and 
interactions between the stimulus and such 
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dispositional variables are probably gomg s 
afford even greater predictive accuracy iie 
personality variables alone. The situation? 
determinants facilitating or inhibiting ar 
sion and those affecting the number of alie 
native nonaggressive responses are pan 
obvious in their relevance. Finally, an ara i 
sis which ignores the aggressor’s expecta ot 
regarding the effectiveness of his projected ple 
would probably ignore an important pr 
affecting the likelihood of any, not jus 
aggressive, behavioral act. e of 
Keeping in mind the expected outcom 
an aggressive act, we might then W'S 
investigate the dimensions along woe 
stimulus other than the original target r an 
vary in order to present to the ageress in 
acceptable alternative target. We kee de- 
vestigate whether the other traditions ore 
terminant of displacement, that of & of 
threshold, might be more economical ¥ of 
plainable as a reduction in the nu™ js a 
Perceived response alternatives. Cathet ful 
its effects can be studied in terms of t? 
fillment of expectations regarding th “py. 
pleted act, and anticipation concerning 5 by 
Sequent aggression, Finally, as note a i 
Feshbach (1964), there is a definite ™ onal 


can 
a precise analysis of the roles of €" | jou 


€ v 
arousal in aggression, in terms of the 


possibilities discussed earlier. 


jvt 
ssl 
js 7 4 gr 
Summarizing Current Views On equen 


Behavior as a Stimulus-Response si j? 
Feshbach (1964) presented a fairly plow 


flow chart for aggressive behavior- E! nof 
Chart for Aggressive Response SU88 ides 
this model can be amplified by oor erib? 
the role of additional variables i sihe” 
earlier, Choice Point 1 denotes the zi iph 
tion of a stimulus as obstructing, ¢ rsi F 
habitually noxious—all defined as 4Y? imt” 
as not aversive, Where objective cl i 
conditions are constant, the habit o! ets in 
ing a member of certain stimulus subs w 
the subjective subset of “noxious stimi oi 
e viewed as being equivalent to th¢ ei gg 


r 
syndrome, while the likelihood of Peery 
a stimulus as threatening oF 
might be seen as related to, respec : 
esteem, and the ability to perce! 
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CHOICE 
POINT 


determinants: optimism, self-esteem, hostility, transient states of perceptual orientation. 
te: 


ation of aggressive or nonaggressive response (fractional anticipatory goal response) 
sive habit. 


i e ei a ssive naggressive responsi 
establisp 6 State: (a) Arousal may be classically conditioned to either aggressive or nonaggı ponse and becomes 


du ished 
Coreg" by Cognitive mediation; 


i vi it ay be labeled anger, (b) Arousal may be in- 
as an operant energizer. In the case of ag ressive response, it may 
itive mediation: the subject talks himself into being angry. 


i i reak e: y turn to CP 2 
Distosit; 3—-Continuation of aggressive response, weakened by amount > 0, or rei o C 
CHor tional determinants: conditional inhibition of aggressive responses (e.g, aggression anxiety). 


tite 
rip SOUS or alternative goal paths to the 
ginal goal 


Not a role of autonomic arousal in general, 
Consi 


da. 
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Sessarily labeled as anger, at this point 
S simply in eliminating alternatives of 
i Cation. More important than the auto- 
tary © State appears to be a person’s momen- 
fageghtCceptual orientation at the time he is 
ih With the stimulus. He may be anticipat- 
Noxious, threatening, or obstructing 
las ip and thereby increase the likelihood of 
atoy Og them accordingly. The effect of 
Wili 2t this point is, then, that of a prob- 
doq c Operator, which changes the likeli- 
Črta; ât a stimulus will be classified in a 
n way, 
t ga tice Point 2 represents the initiation of 
“tego “°SPOnse, It is assumed that a stimulus 


| Orized as not aversive has an initial 
\ 


4—Continuation and completion of aggressive or nonaggressive response or return to CP 1 or CP 2 
determinants: social and other reinforcement cues. 


probability of zero of evoking an aggressive 
fractional anticipating goal response, while 
noxious, obstructing, or threatening stimuli 
all have equal probabilities greater than (or 
equal to) zero of initiating an aggressive re- 
sponse. It is at this point that the construct 
“aggressive tendency” or “aggressive habit” 
becomes relevant. It plays the role of a proba- 
bilistic operator and, for reasons of economy, 
may be viewed as identical across classifica- 
tions though, of course, differing across sub- 
jects. , 

Stimulus classification (at Choice Point 
1), affected by both dispositional and situa- 
tional determinants, can be conceptualized as 
standing in a multiplicative relationship to 
aggressive habit. Where a not aversive classi- 
fication has been made or where aggressive 
habit is extremely low, the likelihood of an 
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aggressive response being initiated becomes 
very small. 

At Choice Point 1, anger may be present 
for two of the reasons discussed earlier. It 
may be an initially classically conditioned, 
later operant-arousal state associated with a 
response choice and subsequently labeled 
anger (see Shapiro, Crider, & Tursky, 1964), 
or it may be the result of deliberate self- 
stimulation, reinforced through previous, suc- 
cessful response activation; that is, the per- 
son talks himself into a rage (see Brehm, 
Back, & Bogdonoff, 1964). The role of auto- 
nomic arousal here is thus seen as simply 
energizing rather than as affecting the likeli- 
hood of a choice. 

Choice Point 3 deals with the dispositional 
inhibitory variables such as aggression anxi- 
ety. Such aggression anxiety is seen as a 
conditioned inhibitory response which may 
either, in the traditional Hullian sense, de- 
tract from the reaction potential of the on- 
going aggressive response, or, through a feed- 
back loop, result in the selection of an alter- 
native response. (The selection of an alterna- 
tive response can also be accounted for in the 
traditional paradigm by referring to response 
hierarchies, where a new response becomes 
dominant if another one is inhibited. How- 
ever, the notion of feedback and cognitive re- 
evaluation of an ongoing response is a useful 
one in this instance.) 

f Choice Point 4 refers to the situational 
including the social, determinants of the situ- 
ation. The functions of these determinants 
may be viewed as permitting the person to 
mediate anticipated rewards or punishments 
which appear to be likely outcomes of his 
present ongoing response. The effect of these 
determinants is either to “clear” the ongoin, 
response for completion, or to bring about : 
return to Choice Point 2 for selection of an 
alternate response. This process may take 
place where the original ongoing response is 
aggressive, that is, the person perceives the 
inappropriateness of his response and modi- 
fies it, or it may in some instances lead to the 
initiation of an aggressive response, even 
though the original response was nonaggres- 
sive. In the case where a stimulus had been 
classified as not aversive at Choice Point 1 
and initiated a nonaggressive response, we 
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should have the equivalent of attack ager 
a target for purely social or similar Tew 
expectations. It is important to note ture 
repeated social aggression of this ne E 
through classical conditioning may sonaa 
to a greater probability that a set afsh t 1. 
will be classified as noxious at Choice ror 
Choice Point 4 may also result in @ A o 
to Choice Point 1, that is, a reevaluay ation 
the stimulus itself. Again, this pe ini- 
may take place by classifying a stimu vg the 
tially perceived as not aversive into one o 
three aversive categories, or vice verse e te 
Once the aggressive or nonageress acts 
sponse has been completed, the su Be 
evaluation of the outcome may r auto 
to take place. It is at this late stage tha dure 
nomic arousal or anger may occur OF 
as a dissonance or anxiety reduce". „p she 
The present classification suggests : tim 


dispositional determinants of perce! ma sand 
uli in a certain manner (Choice eae 2) 
strength of aggressive habit (Choice ut such 
should be susceptible to retraining, Jeng 
a retraining process could be quite <, may 
Building up inhibitions at Choice PO”, 
be a questionable strategy, since * ssi A 
less aggressive behavior, where ag8"®™ | 
sponse tendency is held constant, 
greater conflict. The opportunities 
sented by Choice Point 4 consist ° 
the likelihood of a favorable S0C% ply: & 
payoff for aggression lower, OF) Sper i 
raising the expectation of a favorab genet 
for a nonaggressive response. * 
approach here has been either to 1” 
evaluation of the stimulus, discuss". 
tore (1952), Pepitone (1958), and ons? wee 
(1964), or to encourage a new re ie of 
stimulus which need not necessarily bei 
ceived as Jes jectionable ; 
(Kaufmann & Ss 1963a, 19630 wt 
a modification of an ongoing tes? 
but need not, be “frustrating,” 7 
Perceived as a threat or a Pa 
As the second Kaufmann a” 
(1963b) study indicated, the %5 
factor may be whether a person ae, 
aggression because he fears PY gat” 
guilt, or because the rewards of aog 
promise to exceed those of agate 
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MEASUREMENT OF STIMULUS CONTROL DURING 
DISCRIMINATIVE OPERANT CONDITIONING? 


HERBERT M. JENKINS 


McMaster University, Ontario 


The procedures in common use in discriminative operant conditioning are 
examined. It is concluded that they do not provide useful measures of stimulus 
control during acquisition because they confound the discriminative stimuli 
with antecedent reinforcement-nonreinforcement. Procedures which avoid con- 
founding and provide measures of stimulus control during an uninterrupted 
acquisition process are described. 1 such procedure is applied in an experi- 
ment on sequential effects in discrimination learning. Substantial effects of 
antecedent reinforcement-nonreinforcement on generalized response probability 
and on the difference in probability of response to the discriminative stimuli 
are shown. The existence of effects of this type indicates that confounding 
can introduce serious biases into the measures of stimulus control in discrimina- 


tive operant conditioning. 


It is widely recognized that discriminative 
Conditioning by the method of successive 
Presentations occupies a central position in 
the analysis of operant behavior. It is there- 
Ore surprising that this form of discrimina- 
tion learning has not been subjected to a 
More intensive experimental attack within the 
°Perant conditioning framework. In recent 
Years, the technique of generalization testing 
developed by Guttman and Kalish (1956) 

as been used to gather a considerable har- 
Vest of new information on stimulus control 
aS an end product of various conditioning 
Tegimes, Our knowledge of the development 
Of stimulus control during conditioning, on 
the other hand, has not gone forward at a 
Comparable pace. 

The long-standing neglect of discrimina- 
tion learning in operant conditioning may be 
due to an incompatibility between the pro- 
Cedures of operant conditioning, as they have 
evolved, and the conditions which must be 
met if stimulus control is to be measured 
during learning. Stimulus control is measured 
in terms of a difference in behavior attributa- 

le to experimental stimuli. Responding in 
the presence of a stimulus is not, in itself, 
“idence of stimulus control. Control is a 
Ynonym for covariation. For example, a gen- 
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eralization gradient shows stimulus control to 
the extent that some measure of the response 
varies with stimulus values (cf. Jenkins, 
1965). In order to obtain an interpretable 
measure of stimulus control, one not con- 
founded with other effects, it is necessary 
to measure behavior under conditions which 
differ only with respect to the stimuli in 
question. 

Now, the thesis of the present paper is, in 
outline, as follows. The conditions in common 
use in operant studies do not make it pos- 
sible to obtain the required measures of 
stimulus control during an uninterrupted 
process of acquisition. This is so because the 
procedures allow the effects of antecedent 
reinforcement and nonreinforcement on re- 
sponse probability to be confounded with 
the effects of the discriminative stimuli on 
response probability. The tactics which have 
been adopted to minimize or control for con- 
founding have had bad side effects. These 
tactics lead to procedures which are, to put 
it moderately, awkward for analytical pur- 
poses. By modifying the procedures in certain 
ways, confounding can be avoided entirely; 
not just minimized or controlled for. 

This argument is supported by an experi- 
ment in which some of the proposed modifica- 
tions in procedure are applied. The specific 
purpose of the experiment is to trace the 
effect of antecedent reinforcement and non- 
reinforcement on stimulus control during 
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learning. The results lend empirical support 
to the arguments about confounding since 
they show that effects of antecedent rein- 
forcement and nonreinforcement, which 
would confound the measurement of stimulus 
control in typical operant procedures, do 
indeed exist in considerable strength. 


How CONFOUNDING ARISES 


Confounding arises whenever presentations 
of the discriminative stimuli are correlated 
with the context provided by prior reinforce- 
ment-nonreinforcement. Consider a hypo- 
thetical example in which the correlation is 
made obvious. Suppose one arranged 3-minute 
presentations of S; and Sọ and reinforced 
every response in S;, and no responses in So. 
In effect, behavior in S, would be sampled 
repeatedly during the 3-minute Si presenta- 
tion, and, with the exception of the interval 
between the onset of the stimulus and the 
first response, the samples would all be in the 
context of antecedent reinforcement. Re- 
sponses in the S, period, on the other hand, 
would be sampled repeatedly and almost ex- 
clusively in the context of nonreinforcement, 
Therefore, context and stimulus Presentation 
would be highly correlated, 


Antecedent reinforcement-nonreinforcement 


one thing, there are 
reinforcement and no 
forcement in S, increases 
response both to S; an 
Se. Similarly, nonrei 


positive (S1) and 


tions. When presentations of Sı and Sy are 


or reinforce- 
neralized ef- 
ed with the 


stimuli on 
responding. 


A correlation between antecedent reinforce- 
ment-nonreinforcement and sti 
tations introduces the possibility of another 
kind of effect, Reinforcement-nonreinforce- 
ment may develop a cue function, The basis 


mulus presen- 
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for a cue function is evident in SA wA l 
thetical example. One reiiforcewene w, just 
predicts another reinforcement to fo dicts an- 
as one nonreinforcement in S predi impot- 
other nonreinforcement to follow. It < can 
tant to note, however, that casa ne 
occur even though no cue function eneral- 
If there is some effect of context ante an 
ized response probability and if satel con- 
stimulus presentation are eager made 
founding is present. The cue ¢ t another 
possible by the correlation is Jus response 
way in which context can altel asureme! ; 
probability and confound the me | 
of stimulus control. 

The use of regular reinforcen™ and 0 
example makes the problem ono dif- 
one would consider the perior o be 
ferential between Sı and Se perio cb ad ar- 
measure of stimulus control in SU 
rangement, However, the problem 
although it becomes less OO iils 
reinforcement to S, is intermit erant ©” 
generally is in discriminative OP 
ditioning, 3 ocedt 

ily ne operant conditioning Foa more 
with maintained stimulus present: a re 
likely to introduce a correlatio the 
context and stimulus presentation way, he 
the procedures used with the ice one 
jumping stand, and related device® 
reason is that in these other e 

` asures ©, h th! 
does not continue to take me tW 
ples of behavior after the point 4 
occurrence of reinforcement 4” 
forcement differentiates positive 
tive trials. Successive samples & 
separate trials, Since it is both ea 
Customary to arrange a rando uli nt” 
of trials, the confounding of 


me 
+ force 
antecedent reinforcement-nonre!™ 


nt in our 


: very of 
a source of confounding. en ne © 
cedure is obviously not suited 
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ous measurement of the development of 
stimulus control because it interrupts condi- 
tioning by extinction in order to obtain a 
Measurement. Another tactic is to use a con- 
trol group in which only reinforcement dif- 
ferentiates Positive from negative presenta- 
tions (Frick, 1948: Smith & Hoy, 1954). 
Owever, this results in an indirect measure 
of stimulus control which rests on a between- 
Stoup comparison of performance with and 
Without the stimuli rather than on a response 
ifferential within an animal. The objection 
Seems less serious if one finds that the rein- 
Orcement schedule alone produces no signifi- 
Cant performance differential between posi- 
tive and negative periods, but even in this 
fortunate circumstance the possibility that 
© reinforcement schedule introduces a bias 
When it interacts with the discriminative 
Stimuli cannot be ruled out. Finally, a tactic 
80 widely used that it may be considered a 
characteristic of discriminative operant con- 
tioning is the use of intermittent rather 
than regular reinforcement. This tactic no 
Subt succeeds in reducing the magnitiude 
of Confounding since it reduces the correla- 
‘on between antecedent reinforcement-non- 
Teinforcement and stimulus presentations. 
Ut, it does this at the cost of also reducing 
e correlation between the discriminative 
Stimuli and reinforcement. The latter correla- 
tion, which is perfect under regular reinforce- 
Ment, is an important and interesting vari- 
able in itself, A procedure in which the full 
Tange of the variable cannot be explored 
Without giving up the possibility of a con- 
tinuous measure of stimulus control during 
acquisition is seriously limited. 


Ways to Avo ConFouNDING 


In order to see how confounding may be 
avoided, it is necessary to examine in more 
detail the relation between context and 
discriminative stimuli both within stimu- 
US presentations, and in the sequence of 
Tesentations. 


(el ; ae . 
"founding within Presentations 


b Two ways of relating reinforcement to 
siti i 


are qve and negative stimulus presentations 
Shown in Figure 1. In Case A, Sı and 


J. 
; 7 
COMPARISON COMPARISON 
IN VAL INTERVAL 
A B 


Fic. 1. Two ways of programing reinforcement 
(shown by vertical arrow) in relation to stimulus 
presentations. (In A, Sı and Sə trials are of fixed 
duration, In B, the S, trial terminates at the first 
reinforcement while the Sə trial is of fixed duration, 
Within the comparison interval, performance is not 
confounded by reinforcement or nonreinforcement.) 


Se are of equal duration. On some or all S, 
trials, one or more reinforcements occur 
according to some schedule. The interval 
from trial onset to the first reinforcement 
(vertical arrow in Fig. 1) is labeled as the 
comparison interval, since any S,—S» per- 
formance differential measured within this 
interval can only be attributed to the stimuli. 
Beyond this interval there is confounding, 
since performance must be regarded as a 
joint function of stimuli and antecedent re- 
inforcement. Confounding from this source is 
very common in practice (e.g, Bower & 
Grusec, 1964; Dinsmoor, 1950; Frick, 1948; 
Pierrel & Sherman, 1962; Smith & Hoy, 
1954). Commonly used schedules of rein- 
forcement in S, also introduce the likelihood 
that reinforcement will serve a cue function. 
Reinforcement early in the period signals an 
increased probability of reinforcement in the 
remainder of the period. Consider, for ex- 
ample, a case in which Sı and Sə presenta- 
tions are of equal duration and appear 
equally often, On S, trials, zero, one, or two 
reinforcements are programed with equal fre- 
quency; no reinforcements occur in Sə. For 
ease of calculation, assume the period of pres- 
entation to be divided into halves which con- 
tain, at most, one reinforcement, Now, ignor- 
ing the stimulus differences, the probability 
of reinforcement in the second half given 
none in the first half will be 1:9, However 
when a reinforcement has occurred in the 
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first half, the probability of reinforcement in 
the second half jumps to 2:3. Thus, there 
exists a basis for the development of a 
discriminative or cue effect of reinforcement. 

In Case B of Figure 1, the S, trial is termi- 
nated by reinforcement, but the Sə trial is of 
fixed duration and extends beyond the com- 
parison interval. The arrangement is no less 
confounded in principle than is Case A.A 
period of nonreinforcement exceeding a cer- 
tain duration (when reinforcement is on an 
interval schedule) or continuing after the 
completion of a certain number of responses 
(when reinforcement is on a ratio schedule), 
occurs only on Sə trials. The early experi- 
ments on discriminative conditioning by 
Skinner (1938, pp. 183-192) were on this 
plan. Another arrangement which would lead 
to Sə durations in excess of S; is one in which 
the termination of the Sə period is made 
contingent upon a period of no responding, 

Confounding within trials may be avoided 
simply by terminating Sı and Sə according 
to the same rule or contingency, When this 
is done, all measurements will be taken 
within a comparison interval; that is, before 
the presentations are differentiated by the 
reinforcement contingency, For example, if S, 
is terminated in reinforcement by the com- 
pletion of a certain number of responses 
(ratio contingency), then So must also be 
terminated at the completion of this response 
requirement. Variable contingencies of the 
interval type can be used as well. Further, 
the probability that S, terminates in rein- 
forcement when the contingency is met 
can be varied. Since the contingency will 
not always be met, particularly on S, trials, 
it Is necessary that both S, and Sə trials 
have some maximum allowable duration 
after which they are terminated by external 
control. 

An arrangement which is ideal for the 
purposes of measurement may not be ideal 
for training a discrimination quickly. In par- 
ticular, in order to avoid some form of 
secondary reinforcement from the termina- 
tion of the trial, or just to allow for heavier 
doses of nonreinforcement, one might wish to 
maintain Sy beyond the point at which the 
reinforcement contingency is met. Fortu- 


nately, it is a simple matter to avoid con- 
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founding even though trials are not a 
terminated after the comparison ensite: 
The requirement is only that the m al be 
ments taken in the comparison mie that 
separated from those taken beyon “foul 
interval. The control circuit which rison 
have terminated trials after the ng re- 
interval can be used to transfer n en 
cording of data from one register that re- 
other, or simply to stop recording s remain 
sults for the comparison interva 
uncontaminated. 


Confounding in the Sequence 


i ich pos 

It is clear that a procedure 1n wh rations 
tive and negative periods of fixe en r 
are alternated (e.g., Bower & Gruse® 


i 
ds 

c 4 onfoun 
Dinsmoor, 1950; Frick, 1948) T f the 
discriminative stimuli with the eve sen- 


: re 

immediately preceding presentati ` will be 
tations of the positive stimulus, ement 3 
regularly preceded by nonreinforc® y only 
the negative stimulus, Se. Conver iy te 
Se presentations will be immedia p ië 
ceded by reinforced responding HR in per 
sequence of this kind, a differenc®. the 


t rel 
carry-over effects of anteceden 


t 
also develop. asuret, 
In order not to confound the Ten ior 
of stimulus control, the context A G 
ceding presentations must be t 
presentations of S, and Se- 
independence of context is rea rete 
when a random sequence of mee can 
Se trials is used, The requireme” | + is, 
be met in a continuous procedure: 
in which no third condition, suc” d 7? 


2 ref 
from the apparatus, separates SE n of Fol 
simply arranges a random ae wt A 
entations and programs the oe sti at 
gaps. In a sequence of this oct E ich gt 
remains unchanged for intervals we 


r tet 
in duration in the same way rials ve 7 
of runs of positive and negative ial : Poe 
a random sequence of discrete n of du" qv" 
ingly, the frequency distributio ‘im us 
would be geometric. Variable $ 

tions were used by Pierrel 


Vi 
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(1962), but the distribution of durations was 
not geometric, 
ven though a continuous procedure can 
2¢ arranged to give contextual independence, 
it has Some undesirable properties which can 
ca in discrete trial procedures. In the 
is ee Procedure, any change in stimulus 
Ompanied by a change from a reinforced 


to ; : p 
T a nonreinforced period, or vice versa. 
erefore, the change of stimulus, rather 


Se or in addition to distinctive features 
maintained stimuli, may underlie a per- 
ormance differential. Further, responses in 
© negative period receive secondary rein- 
°tcement at the transition to the positive 
Period. The secondary reinforcement would 
© expected to slow the rate of reduction in 
2 responding, and perhaps also to raise the 
asymptotic level of Sy responding. Breaking 
e stimulus sequence into separate periods 
avoids these difficulties and has the further 
analytical advantage of allowing experimental 
Control over the temporal spacing of presenta- 
tions, For these reasons, the discussion which 
Ollows is primarily concerned with discrete 
trial procedures, 
here are several ways to separate stimulus 
Presentations. The manipulandum may be 
Tetracted. A blackout may be used when the 
S are pigeons, since darkness naturally in- 
duces inactivity in birds. An alternative is to 
Separate trials by a discriminative stimulus 
Clearly different from the trial stimuli 
(Jenkins, 1961, 1962), In the present experi- 
ment, for example, the pigeon’s compartment 
Temained illuminated at all times, but the 
Tesponse key was lighted only on the trials. 
Intertrial responses to the unlighted key, 
which occur at the outset of training, were 
quickly eliminated through nonreinforcement, 
The separation of trials in this way avoids 
the radical changes in the light-adaptation 
level of the pigeon’s eye which blackouts 
Cause. The method also ensures that re- 
Sponses are made to the trial stimulus, and 
Not simply because a physical constraint or 
a stimulus having strong unconditioned ef- 
ects has been removed. Finally, it may be 
Noted that the use of a discriminative stimu- 
MS to separate trials makes operant and 
assicay conditioning procedures more alike. 
© salivary response in classical condition- 
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ing, for example, is not physically blocked 
between trials. Rather, a discrimination be- 
tween background alone, and background 
plus CS, keeps between-trial salivary flow: at 
a minimum. An arrangement of this kind 
may be referred to as a discriminated trial 
procedure to distinguish it from what might 
be called a gated trial procedure in which 
the response is physically prevented between 
trials. 

When discrete trials are employed, inde- 
pendence of context is easily ensured by the 
use of an entirely random sequence of Sı 
and Sə trials. A sequence of this type may, 
however, be inappropriate for the purposes 
of a particular experiment. One might re- 
quire, for example, that the lengths of runs 
of trials of one type can be kept to a mini- 
mum. The Gellermann series (Gellermann, 
1933) have a maximum run length of three 
trials of the same type and are sometimes 
used to determine the sequence of positive 
and negative periods in successive discrimina- 
tions (e.g., Mabry, 1965: Pierrel & Sherman, 
1962). These series were originally devised 
to avoid correct choices on the basis of posi- 
tion or alternation habits in simultaneous 
discrimination problems. Although they may 
be suitable for that purpose, their use in suc- 
cessive discriminations is unfortunate. The 
potential for confounding is evident, since 
only S; trials will appear after three consecu- 
tive nonreinforcements and only Sə trials 
after three consecutive reinforcements, It 
should be noted that the animal does not 
have to “learn the series” in order for its 
performance to be dependent on the events 
of prior trials. Discriminative effects of re- 
inforcement and nonreinforcement may of 
course develop, but even if they do not, the 
generalized effect of prior reinforcement or 
nonreinforcement on response probability 
would be a source of confounding. 

It is of interest that sequences with special 
features may be contrived which, although 
highly nonrandom overall, yield unbiased 
measures of stimulus control at certain 
points. The sequence shown in Figure 2 is an 
example. The trial sequence generated by this 
schema, like the Gellerman series, has a maxi- 
mum run length of three trials of the same 
type. There is, in fact, a regular alternation 
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TRIAL NUMBERS 
i in which odd- 
. Schema of a trial sequence on } 

a ae trials alternate regularly between = 
ae hile even-numbered trials provide pees 
a es of stimulus control, (Sı is presented wit 
a pability P; S: with probability 1-P. The last 
er shown dotted, is the same as the first.) 
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ositive and negative stimuli on odd- 
i trials, and the sequence, taken as 
a whole, is confounded. The context given by 
the immediately preceding trial is equivalent 
for Sı and Se, but the context is not equiva- 
lent when two preceding trials are considered. 
An So trial always occurs two trials in back 
of the odd-numbered S; trial, and, conversely, 
an Sı trial always occurs two trials in back 
of the odd-numbered Sə trial. On the other 
hand, on even-numbered trials, where the 
branches occur, S; and Se presentations are 
independent of context, Thus, by separating 
performance measures taken on even-num- 
bered trials from the rest, unconfounded 
measures are obtained. The S, and S2 per- 
formance differential on even-numbered trials 
following S; (Trials 2, 6, 10, etc.) might be 
systematically different from the differential 
on even-numbered trials following Sə (Trials 
4, 8, 12, etc.). That would indicate an effect 
of context on stimulus control. Whether or 
not the context has an effect, however, data 
from all even-numbered trials may be pooled 
without confounding as long as the probabili- 
ties of Sı and So trials following S; trials 
are the same as the probabilities following S. 
trials. In that case, there is no contingency 
between the context provided by the fixed 
trials and the discriminative stimuli on the 
branches. The effect of different contexts 
therefore applies equally to S, and So. The 
rule is that data taken from locations in a 
sequence at which antecedents are system- 
atically different may be pooled without 
confounding as long as there is no contin- 
gency between antecedents and stimulus 
presentations, 
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errors cause a repetition of the tri onrandom 
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Procedure 


Subjects. The Ss were 6 male White Carneaux 
Pigeons 5-6 years old, without prior experimental 
history, They were maintained at 80% of their free- 
feeding weight. 

Apparatus. An automatic key-pecking apparatus 
(Ferster & Skinner, 1957) was used. The compart- 
Ment was constantly illuminated by diffuse light 
from a source which extended across the top of the 
Tesponse Panel, Equipment sounds were masked by 
Waite noise from a speaker within the compartment. 

einforcement Was access for 3.5 sec. to a tray 
Containing mixed grain. 

The stimuli were line grids which were automati- 
cally positioned behind the transparent key and 
fansilluminated. The discrimination was based on 

e orientation of the grid. A vertical grid served 
aS the reinforced or Sı stimulus. The nonreinforced 
Stimulus, Se, was a horizontal grid, or in another 
Phase of the experiment, a grid rotated 15° from 
the vertical, 

Trials, A discriminated trial procedure was used. 
rials were initiated by lighting the key and were 
erminated (key-light off) by the first response or 
Y external control if no response was made within 
Sec, A response to Sı was immediately reinforced. 
uring the intertrial periods the key was dark, but 
os compartment remained illuminated. The inter- 

als between onsets of trials were equally often 15, 

0, or 45 sec. in random sequence. An intertrial 


Tesponse prevented the appearance of a trial within 
the next 60 sec. 


ti 
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Preliminary Training. The Ss were trained to 
respond to S: by the method of successive ap- 
proximation. After several reinforced responses had 
been made, intertrial intervals were introduced and 
the maximum trial duration was shortened to 5 sec. 
The development of a quick response to the lighted 
key and the virtual elimination of intertrial responses 
generally required one or two sessions. Beyond the 
initial sessions, every S received 6 sessions of train- 
ing, each consisting of 40 S, trials, before being 
exposed to Se. 

Trial Sequence. A type of correction trial pro- 
cedure was used. An equal number of Sı and S: 
trials was programed by the use of punched tape. 
Following correct trials (response to Sı or no re- 
sponse to Sz), the tape advanced so that the next 
trial was governed by the taped random sequence. 
On the other hand, following incorrect trials (no 
response to Sı or response to S:) the tape remained 
in position and the same stimulus appeared on the 
next trial. Trials presented after an advance of the 
tape are called random trials to distinguish them 
from repeat trials. 

Plan of Analysis. Random trials can only be pre- 
ceded by a correct Sı trial (S:R), which entails a 
reinforced response, or by a correct Sa trial (SsNoR), 
which entails no response and no reinforcement. 
Performance on random Sı and Sə trials following 
each type of correct trial and performance on repeat 
trials were separately analyzed so that the effect of 
these antecedents could be identified. The analysis 
keeps track of only the immediately preceding trial. 
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Fic. 3. Mean response probability by sessions as a function of stimulus and antecedent 
trial; easy discrimination. (Antecedent trial is shown after the slant line.) 
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However, when there is substantial generalization 
from Si, a correct Sə trial will commonly occur only 
at the end of a series of repeat S» trials. 

Two measures of performance were taken: the 
probability of response, and in the case of a re- 
sponse, its latency to the nearest tenth of a second, 

Discrimination Problems, After preliminary train- 
ing, 14 sessions were run on an easy discrimination, 
The Sı was set at 0° (vertical) and Se at 90° 
(horizontal). In each session, 40 S, and 40 S2 trials 
were programed. The Ss were then switched to a 
difficult discrimination: S, as before, Sa at 15°, Seven 
sessions, each consisting of 20 S, and 20 S2 programed 
trials, were then run, 

Results 


The probability of response to S, and Se 
following different antecedents is shown by ses- 
sions for the easy discrimination in Figure 3, 
Random S; and Ss trials are preceded by S,R 
(correct Sı trial) or by SsNoR (correct Sy 
trial), giving rise to four separate curves. The 
fifth curve is for repeat S trials; that is, 
an Sə preceded by SoR. Repeat Sı trials were 
infrequent and are omitted, Corresponding 
latency data are given in Figure 4. 


on 
t. a «als as resp 
encies become unavailable on Sz trials 


on data from four or five of the six Ss-) 

sal aP 
Early in training the antecedent a i 
pears to influence the response bo espons 
and to Sa. The mean probability © response 
was lower, and the mean latency of ollowin’ 
was longer, following SəNoR than 3 rials 
SiR. Response strength on repeat fol owint 
was above the level for Se trials qiffere” 5 
an SeNoR. The explanation of this „NOR 
probably lies in the fact that an ia 
typically occurs at the end of a run epe? e 
Sz trials. The average number of © ot 


associated with a deeper state ol f 
than is the average SəR trial. The isc 
antecedent trials diminish as the ining ce 
tion is perfected by continued T res yer 

Mean latency and probability © first "os 
based on combined results for z nd L 
sessions are given for random >i Table 
trials as a function of antecedent i” 
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Ee Paani trial significantly affected 
IS e probability and latency of response 
4 n The difference in probability (.682 

- 370) and the difference in latency (2.02 


ti 


se 
Be . 2.71 sec.) were each significant at 
eee, by two-tailed £ test. Although the 


ise, alas the antecedent trial on responsive- 
failed 4 1 Were in the same direction, they 
of i reach significance either in terms 
lates ability (.968 vs. .834, p<.2) or 
“i (1.69 sec. vs, 1.90 sec., p < .2). 
POA e interest is the fact that the 
tially Ive effect of an SsNoR was substan- 
subse Steater on a subsequent Sə than on a 
r$ quent Sı. In terms of probability, the 
ta os difference following S,R was .968 
83. £ or .286. Following SəNoR it was 
signin 370 or .464. These differentials are 
i Ped different from one another (.286 
a * 404, p< 01). Latency of response yields 
Parallel result, The Sı-S difference in 


l ; 
ag following S,R was 1.69 sec. — 2.02 
sé or .33 sec, Following SoNoR it was 1.90 


dif, 2.71 sec. or —.81 sec. Again, these 
ë erentials are significantly different (—.33 
ec. vs, —.81 sec., p < .02). 
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TABLE 1 


MEAN PROBABILITY OF RESPONSE, P, AND MEAN 
Latency, L, (in SEconps) ON Ranpom Sı AND S: 
TRIALS AS A FUNCTION OF ANTECEDENT TRIAL 


Antecedent trial | Stimulus | P | L 
SR | Š | 968 | 1.69 
SNR | Si ‘334 | 1.90 
SiR | $: | .682 2.02 
S:NoR | S | 3 271 

| 


Note.—First two sessions, easy discrimination. 


Response probabilities for the difficult dis- 
crimination are shown by sessions in Figure 
5. In contrast with the easy discrimination, 
little or no improvement takes place. Also, 
unlike the case of the easy discrimination, the 
probability of response on repeat Sə trials 
was indistinguishable from the probability of 
response to random Sə trials following 
S2NoR. However, the other major effects of 
antecedent trials, which were transient in the 
easy discrimination, were in this case main- 
tained throughout training. Mean latency and 
probability of response for seven sessions are 
shown in Table 2. The direction of differences 
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Fic. 5. Mean response probability by sessions as a function of stimulus and antecedent 
trial; difficult discrimination. (Antecedent trial is shown after the slant line.) 
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TABLE 2 


MEAN PROBABILITY OF RESPONSE, P, AND MEAN 
LATENCY oF RESPONSE, L, (IN SECONDS) ON RANDOM 
Sı AND Sz TRIALS AS A FUNCTION OF ANTECEDENT 


TRIAL 
Antecedent trial Stimulus P L 
SıR Sı -987 1.44 
S:NoR Sı 978 1.59 
SiR Se 866 1.66 
S2NoR S2 -144 1.92 


Note.—Based on all sessions, diffcult discrimination. 


parallels the results for the easy discrimina- 
tion. The antecedent trial had no significant 
effect on the probability of response to Sı 
(.987 vs. .978, £ <1), but did show a signifi- 
cant effect on latency (1.44 sec. vs. 1.59 sec., 
p< .05). The antecedent trials had a clear 
effect on the probability of response to Sa 
(.866 vs. .744, p< .01), and a detectable 
effect on latency (1.66 sec. vs. 1.92 sec., 
p < .05). In terms of probability, the SS. 
difference for an antecedent S,;R was .987 
—.866 or .121, while for an antecedent 
SoNoR it was .978 — .744 or .234. These 
differentials are significantly different from 
one another (.121 vs, .234, p< 01). The 
corresponding differentials for latency differ 
in the same direction but not significantly 
(—.22 sec. vs. —.33 sec., p< .2). 

In the early phase of the easy discrimina- 
tion and throughout the difficult discrimina- 
tion, the antecedent SəNoR leads to a lower 
mean response probability and a longer mean 
response latency than does the antecedent 
SiR. However, the depressive effect of SsNoR 
on responsiveness to Sı was relatively ‘small 


and often insignificant, while the depressive 


effect on Sz was more substantial. As a result 
? 


the discriminative stimuli exerted a greater 
degree of control over performance following 
SNOR than following S,R, 4 


Discussion 


Antecedent reinforcement raised 
bility of response to the stimuli oe 
and antecedent nonreinforcement lowered it 
Parallel effects on latency were observed. There 
was no sign of effects in the opposite direction 
although conceivably such effects could have 
arisen from frustration or by an inductive effect 
of the Pavlovian type (Pavlov, 1960, pp. 188- 
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203). The results of Verplanck (1942) ~~ 
with one exception, in accord with the presen 
results. The exception, it will be remembers e 
was that in his experiment the direction of A 
effect of antecedent reinforcement On Sal 
measure of performance (starting time) terr 
during training. At the end of training, E 
ing times were longer following 4 pee 
trial than following a nonreinforced trial. o 
reversal might have been due to the learnt i 
a discrimination based in part on the regu = i 
of the trial sequence which Verplan pk My the 
However, the same trial sequence withou 
discriminative stimuli did not produce 
reversal. Therefore, if regularity was indet ter- 
sponsible, it must have been regularity 1° "rom 
action with the differentiation of postive the 
negative trials by discriminative stimuli. ed no 
present experiment, antecedent trials provi 1, al 
cue to the nature of the next random trit it re- 
the direction of the effect of the anteceden in 
mained unchanged throughout extensive us 
These results have a clear bearing 0n ™ 
Even in the complete absence of any cue ects 
ties in the trial sequence, substantial efe he 
antecedent trials appear. If presentations o: with 
discriminative stimuli had been correlate pro- 
antecedents as they usually are in ot nine 
cedures for discriminative operant condit nave 
the resulting performance measures woul ts 
been a joint, confounded product of esten ant 
to both antecedent events and the disct m acted 
stimuli. Still larger effects would be {y 
from reinforcement and nonreinforcement th 
Presentations, since within presentation e iy 
would be no intertrial interval to atten’ the 
effects. Antecedent trials affected not Bee per 
level of performance but also the Si sub: 
formance differential, The differential WA 
Stantially greater following a negative 
it was following a positive trial. 
this result be interpreted? ts and 
The antecedents were compound even pelt 
the interpretation of how they exer 7 
effect must wait on experiments in y e A 
parts of the compound are separated. a T 
tion such experiments should answer 2 
what extent do the effects depend ©" sd non 
relation of antecedent reinforcement ante po 
reinforcement with Sı and Sọ? It is ae oe 
sible that the effects are not entirely, ml 
largely, dependent on an associative e tO g5 
specific process, They might have mo gnes te 
with changes in the level of respons! at a p 
such. One might find, for example © ont 1s 
cedent reinforcement or nonreinforet! me of 
an irrevelant stimulus would have si” 
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on the performance differential on the trial 


following. Perhaps just the opportunity to eat, 
even though not used as a reinforcement for a 
‘ey peck, would raise responsiveness on the 
Subsequent trial and reduce the performance 

erential. The existence of nonassociative ef- 
fects of this kind is suggested by two findings: 
sa) relative stimulus-generalization gradients 
ecome steeper as drive is lowered (Jenkins, 
Pascal, & Walker, 1958), and (b) relative gradi- 
ents also become steeper as the overall level of 
responsiveness decreases due to extinction in 
Successive Generalization tests (Friedman & Gutt- 
man, 1965; Jenkins & Harrison, 1960. 

e present approach can be related in a 
8eneral Way to Harlow’s error factor analysis 
of simultaneous-choice discrimination on the 

'sconsin General Test Apparatus (Harlow, 
1959), In both cases, the progress of discrimina- 
tion learning is viewed as a composite of local 
elects, Further, it is perhaps more than co- 
incidence that Harlow also finds better per- 
te ance after a nonreinforced trial than after a 
ae aforced trial. He identifies this error factor 
Sa tendency to shift responses (response shift 
aad from one object to another on successive 
3 als. Thus an error is more likely to follow 
t correct choice than to follow an incorrect 
5 Olce, His interpretation is specific to the 
multaneous-choice task, but perhaps it will be 
Ound that a general lowering of responsiveness 
aS a result of nonreinforcement plays a role both 


“eae and in simultaneous-discrimination 
asks, 


SUMMARY 


. Procedures in common use in discrimina- 
tive Operant conditioning do not make it pos- 
sible to obtain unbiased measures of stimulus 
control during an uninterrupted acquisition 
Process. The diffculty is that in operant 
procedures, presentations of discriminative 
stimuli are almost invariably correlated with 
antecedent reinforcement-nonreinforcement. 
The correlation confounds whatever effects 
may be due to the discriminative stimuli 
With two different effects of antecedent 
‘einforcement-nonreinforcement: (a) effects 
n the generalized strength of response and 
(è) cue effects, 
iy termittent reinforcement is widely used 
ingotder to minimize confounding. However, 
Vary mittency of reinforcement is, in itself, a 
Yeg tle which undoubtedly affects the de- 
ment of discrimination, A procedure 


375 


which cannot provide interpretable measures 
of stimulus control throughout the range of 
this variable from partial to regular reinforce- 
ment is seriously limited. 

One can avoid confounding within stimulus 
presentations by applying the same rule to 
the termination of positive and negative 
presentations. The rule or contingency which 
terminates positive trials and delivers rein- 
forcement with some probability also termi- 
nates negative trials, but without reinforce- 
ment. Confounding can also be avoided even 
though the presentations are not actually 
terminated at the point at which the con- 
tingency is met, As long as performance mea- 
sures which are taken up to that point are 
recorded separately from measures which are 
taken beyond that point, confounding will be 
avoided. 

Confounding due to antecedents in the 
Sequence of presentations can be avoided by 
the use of a random sequence of positive and 
negative stimuli. However, the sequence need 
not be random in its entirety. The require- 
ment is only that at some points in the 
sequence the discriminative stimuli be pre- 
sented at random and that the measurements 
taken at these points be separated from 
the measurements taken on the nonrandom 
portions of the sequence, 

In principle, confounding can be avoided 
just as readily when successive presentations 
appear without a gap as when separate pres- 
entations or trials are used. However, sepa- 
rate presentations have certain analytical 
advantages, One way to separate trials is to 
train a discrimination between trial and 
intertrial (discriminated trial procedure) 
rather than to physically prevent intertrial 
responding (gated trial procedure). 

An experiment in which the discriminated 
trial procedure was used showed clear effects 
of the antecedent trial on the probability and 
latency of response to the stimulus presented 
next. Responsiveness was increased by an 
antecedent positive trial and decreased by an 
antecedent negative trial, These effects were 
prominent early in training on an easy dis- 
crimination and were evident throughout 
training on a difficult discrimination, Sequen- 
tial effects of this sort produce confounding 
when antecedent reinforcement-nonreinforce.. 
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ment is correlated with the discriminative 
stimuli. 

The antecedent trial affects not only the 
level of performance but the size of the 
differential in performance which is attributa- 
ble to the discriminative stimuli. The differ- 
ential is larger following a nonreinforced trial 
than it is following a reinforced trial. 

The procedures which have been outlined 
for obtaining unbiased measures of stimulus 
control during acquisition are quite flexible. 
Presentations may be continuous or sepa- 
rated. Reinforcement may be regular or 
partial. Single or multiple responses may be 
required within each presentation. The con- 
tingency of reinforcement may be variable or 
fixed, and it may be based on time, response 
count, or any combination of these. Further, 
nonrandom sequences with any number of 
special constraints may be used as long as 
random presentations of the discriminative 
stimuli occur at some points in the sequence. 

In view of the analytical advantages 
and flexibility of procedures which provide 
measures of stimulus control which are not 
confounded, one would hope that such pro- 
cedures will become more popular in the 
study of discriminative operant conditioning, 
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DO COMPOUND EVOKED POTENTIALS REFLECT 
PSYCHOLOGICAL CODES?! 


WILLIAM R. UTTAL 


University of Michigan 


The 


purpose of this paper is to examine the question of the utility of the 
compound evoked potential as a sign of perceptual experience. 


A discussion is 


presented of the origin of the peripheral nerve compound action potential and 


the brain potential evoked by impulsive stimuli. 
an Ss is considered to determine what progress 
has been made in correlating these electrical signs with psycho 


on the potentials from hum: 


Some of the recent literature 


physical re- 


sponses. It is concluded that while certain suggestions have been forthcoming, 


both denying and affirming the re’ 


lation, the evidence is still inconclusive that 


simple correspondences exist between the 2. 


Often in our attention to the details of ex- 
Periment and communication, we tend to lose 
Sight of the greater goals which motivate the 
Particular research with which we are in- 
volved, For many of us in the field of physio- 
logical psychology, the main theme of our 
research has been a search for correlations 
between the functions of the brain and the 
Verbal discriminative responses emitted by a 
Subject in a psychophysiological experiment. 
So far, we have not progressed far towards 
this goal since not only do we lack adequate 
knowledge of both brain function and dis- 
criminative behavior, but there have been 
few experiments which have been specifically 
designed to compare the two types of data 
under identical stimulus conditions, 

Nevertheless, many of us have assumed 
that these relations exist and that it is con- 
ceptually possible and Practically probable 
by means of the currently available research 
methodology to establish them. It is, in addi- 
tion, true that many of us have gone beyond 
this general act of faith to a more specific 
ne, This more specific idea involves the as- 


x ‘The preparation of this paper and certain of the 

pacarch projects described in it were supported in 

tint by Research Grant MH 8786-01 from the Na- 

th nal Institute of Health and Grant GB 2000 from 
© National Science Foundation. 
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sumption that it is possible to establish such 
relationships between the compound evoked 
potentials elicited by impulsive stimuli and 
psychological phenomena. 

It is the purpose of this Paper to explore 
this hypothesis, drawing upon the data forth- 
coming from the psychological and the physio- 
logical laboratories rather than from the 
philosophical exercises which originally gave 
tise to the mind-brain question. 

One area in which there has been progress 
is in the comparison of electrophysiological 
and verbal responses to impulsive stimuli, 
such as clicks, flashes of light, and brief elec- 
trical shocks. Comparisons such as these are 
valid, since the stimulus conditions are well 
determined. Electrophysiological responses 
are time-locked with specific latencies follow- 
ing the stimulus presentation, and the psycho- 
physical responses are controlled with an ap- 
propriate experimental design to reflect only 
the transient effects of the stimulus. 

We will exclude from this discussion such 
other processes as intelligence, thought, and 
personality for which the stimulus conditions 
are less well understood. It is our opinion 
that these latter types of behavior will have 
to be contrasted with other Physiological 
measures which, in general, are not yet avail- 
able for laboratory investigation, However, it 
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would be inappropriate to imply that these 
functions are more (or less) complex than re- 
sponses to impulsive stimuli. A judgment of 
apparent magnitude of a simple stimulus is 
certainly a process of great complexity. The 
real issue is that there is, in fact, little physi- 
ological information and few proposed stim- 
ulus conditions which are relevant to the 
discussion of these “higher cognitive proc- 
esses.” 

In addition, we are also specifically exclud- 
ing from this discussion nonevoked electrical 
potentials of the nervous system, such as 
electroencephalographic waves. Electroenceph- 
alographic waves are susceptible to uncon- 
trolled variation, since we have not yet es- 
tablished all of the internal and external 
stimulus conditions which can modulate 
them. Vogel and Broverman (1964) reviewed 
the tenuous relationship which seems to hold 
between the electroencephalograph and the 
IQ, for example, without presenting a con- 
vincing argument that the electroencephalo- 
graph can be used as an indicator of intel- 
lectual competence. 

The questions to which we will attend are 
well defined. It is important to establish 
exactly what we mean by a compound evoked 
potential (CEP) and to discuss current the- 
ories of its genesis, In particular, the relation 
between the CEP and the activity of single 
neurons must be elucidated. The second major 
purpose of this paper will be to present re- 
cent research from our laboratory and to 
review other relevant research which bears 
upon the question: Do measures of the com- 
pound evoked potential contain the same 
information that is expressed in the psycho- 
physical judgments, and can we determine the 
nature of the code? An important limitation 
of this paper is that we shall be concerned 
mostly with electrophysiological data gath- 
ered from human subjects rather than from 
lower animals. 

No paper on human CEPs would be com- 
plete without an acknowledgement of the 
contribution of G. D. Dawson. Dawson 
(1947) not only suggested and developed 
the averaging techniques for extracting the 
evoked cortical potential through the intact 
skull of a human being, but also developed 
the percutaneous procedure for recording the 
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peripheral nerve compound action potential 
in man (Dawson & Scott, 1949). His Tied 
was extremely fruitful and can be sean 
with stimulating the widespread eat : 
and effort in evoked potential work on h- 
intact human being. The Dawson CEP aie 
niques are particularly useful in comparé jö- 
studies of psychophysical and neurophy T 
logical responses carried out on Bumar or 
ings because they involve no surgery 
anesthesia. It is this direct compari i ysi- 
tween electrophysiological and psychop aie 
cal data usually collected from the same aon 
vidual under identical, if not simultant ot 
conditions with which we shall be mos evel 
cerned. Indeed, because of Dawson $ pegin- 
opments we today have at least, ine psy 
nings of a physiological description : 
chophysical phenomena. ¢ the 1 
This paper supplements several © iculatly 
views found in Koch (1962), a coves 
the article by Rosner (1962) which 
much of the relevant material up t° 
The reader’s attention is also ae 
recent major publication concerning , ajot 
CEPs. The proceedings of the n onse I? 
symposium on the sensory evoked m 964) 
man has recently appeared (WhiPP ‘ailable 
and contains much of the current 
knowledge pertinent to this paPe™ 


OKED 
Genesis or THE CompounD EV 
POTENTIAL ain tw? 
We shall, in general, be intereste ji ghe 


f the pape “yoke! 
tt 
( of 


classes of CEPs in this part of t? i 
first is the action potential which ous 
from peripheral nerves by percutane per 
ulation and recording. An earliet pe 
tal, 1960a) describes our tr o 
Dawson’s techniques and the elec ratio” 


cuitry we developed for the 8°” tion Fin 
these responses, Peripheral nee vat Tw 
tentials are graded responses whle of E p? 


ty 


amplitude dependent upon a VA? oral! 
ulus intensity and timing con 
These peripheral potentials a? pov? ye 
strong signals and are far enoug an pe 
overall noise level so that they “ill? 
corded and measured in single ° 
sweeps, git? 

The second class of potentials ose p 
we shall be concerned includes 
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tials evoked presumably from the cerebral 
Cortex and recorded through the intact skull 
and scalp. These potentials, unlike the pe- 
‘pheral nerve action potentials, are less syn- 
chronized, more attenuated, and, therefore, 
Smaller in relation to the electrical and physi- 
logical noise in which they are imbedded. 

us, it is necessary to filter these signals 
Y Means of statistical techniques. The best 

Own of these statistical filters is, of course, 
E averaging of time-locked signals such that 
X nts randomly distributed following the 
mulus tend ‘toward zero, while events 
ich are related to the stimulus tend toward 
T average nonzero value. Both analog and 
'sital devices have been applied as averagers. 


Peripheral Nerve Action Potential 


One of the first issues with which we must 
Concerned is the relationship between elec- 
cal responses to natural stimuli and the 

noses generated by impulsive electrical 

station in peripheral nerves. Natural 
muli lead to relatively unsynchronized re- 

Snses in the family of neural fibers which 

je up the compound nerve. Each response 

a typical axonal all-or-none response and 
ran be recorded with either microelectrodes 
Mserted into an individual neuron or a gross 

ao ttode which records the activity of sev- 
tal cells on a single amplifier channel. Pulsed 
Sctrical stimuli, on the other hand, lead to a 

P which is made up of the responses of 
van nerve fibers all firing at the same in- 
ant. This latter type of response may be 
Staded in amplitude, varying as more or fewer 
Ibers contribute their signals to the response. 
he issue is: does the -5-millisecond-long 
ectrical stimulus lead to a response which is 
Coded in a different language and thus mis- 
cad the investigator about normal coding 
\nctions? 

Gasser and Erlanger (1937) originally de- 
ned many of the properties of bundles of 
vons, such as the relation of nerve size to 
duction velocity and temporal dispersion, 

tj, the basis of impulsive electrical stimula- 

tit and observation of the CEP. The ques- 

Vox Which is now raised is whether these 
of te potentials reflect the same statistics 

y s nerve fiber population that could be 

ed with natural stimuli. 


le 


379 


In a very important recent paper, Casby, 
Siminoff, and Houseknecht (1963) reported 
the development of a cross-correlation tech- 
nique which allowed them to repeat the earlier 
axonal studies but with natural stimuli. The 
exposed tibial nerve of a cat was laid over 
two electrodes, and a running cross correla- 
tion at various temporal delays was computed 
for the individual spike action potentials 
which were elicited by brushing, tickling, 
and/or pinching the skin. Although their in- 
genious technique itself is of considerable 
interest, for the purposes of this discussion 
the most important issue is that the results 
obtained from their study were essentially 
identical to those described by Erlanger and 
Gasser. This important finding means that the 
electrical pulse stimulus does not distort the 
code by the simultaneous firing of all of the 
axons. Rather, the synchronicity acts as a 
prestatistical processing which summarizes 
the coding scheme during, instead of follow- 
ing, the physiological process itself. Thus, the 
highly synchronized and graded compound 
action potentials recorded from the human 
peripheral nerves can be assumed to be an 
adequate indication of the normal coding 
function which would be observed if natural 
stimulus conditions were used. 

It is also of considerable interest to con- 
sider whether or not repetitive stimuli of a 
relatively high frequency can give rise to low 
frequency oscillations in the responses of 
bundles of fibers. Such a demonstration 
would be interesting not only because of its 
relation to the peripheral nerve action poten- 
tial, but also because it would suggest that 
we must consider that the summation of very 
brief spike action potentials from axons un- 
der certain conditions might also give rise to 
long-lasting waves even in central parts of the 
nervous system, 

We have obtained just such results in our 
laboratory. The study about to be described 
was carried out with the equipment and 
method described in an earlier report (Uttal, 
1960a) except that, instead of two or three 
pulses, the stimulus was a repetitive string of 
electrical pulses of equal interval and equal 
amplitude. We discovered that there were 
very long period oscillations in the amplitude 
of the compound action potential recorded 
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3 ing spikes) 
Fic. 1. A 200-millisecond-long sweep with stimuli (downward going a ind 
every 8 milliseconds showing the oscillation of the amplitude of the cont} 
potentials (upward going spikes) evoked from human peripheral nerves. 


from human peripheral nerves when the nerves 
were stimulated with such a repetitive string, 
The effects of two independent variables, 
stimulus intensity and interstimulus interval, 
were investigated. A sample 200-millisecond 
record is shown in Figure 1. The downward 
going spikes are stimulus artifacts caused by 
pre-amplifier overdriving and serve only to 
indicate the time the stimulus occurs, The 
upward going lines are the CEPs from the 
peripheral nerves. A single CEP from a single 
stimulus is shown in Figure 2 on a 10-milli- 
second full-scale sweep. Thus, Figure 2 rep- 
resents a time-expanded display of one of 
many sequential responses to the repetitive 


i 
i ‘ i : interval 
stimulation. The interpulse gi milli A 
sample record of Figure 1 was res with a 
The stimulus was 3.5 ampa att 
millisecond duration. It can e e ampli 
is a long-term oscillation in enfia: ned 
. $i em) 
the peripheral nerve action po en de m ters 
gree of this oscillation has i is para jo! 
to be a function of the stimu'y® © cilla! 
mentioned above. Specifically» nest per 
occurs maximally (i.e., the E nsities, 
a vet ir : 
trough ratio) at low de jncrea aati?! 
he 0 
he } 
nigh? 


though the response amplitu 
increasing stimulus intensity. i 
also appears to be affected bY 

interval of the stimuli, having @ 


Fic. 2 


from a human periphe: 


JA 10-millisecond-long Sweep showing a 


otential 


single compound evoked P 
ral nerve. 
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Fic. 
the 


Upper arm above the recording electrode.) 


wet? UEh ratio when the pulses are close to- 
os „cT, even though the wavelength of the 
Cillation does not appreciably change. 
is he most important fact for this discussion 
that the response amplitude of a bundle of 
Xons can under certain conditions be modu- 
€d to produce wave forms which are far 
pager lasting than the individual spike ac- 
S potentials of the neurons in the bundles. 
3" Is also interesting to note that the wave- 
ee of these responses appears to be of the 
der of 100 milliseconds, and that the wave- 
vngth appears to be independent of both 
Merpulse interval and stimulus amplitude. 
i The origin of the oscillations in the ampli- 
“de of the peripheral nerve action potential 
AS not yet been explained. Two principal 
aj >0theses remain once we exclude artifacts 
ty technique and accept these oscillations as 
Won Neurophysiological events. The first as- 
h Fig that the oscillation is due to the changes 
Ves © nerve membrane potentials which occur 
tion 8 tetanic stimulation. Thus, the oscilla- 
ould reflect the collective “hunting” by 
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3. The effect of ischemia produced by a 
peripheral compound evoked potential as a 
between the two conditions: cuff on lower arm 
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@ CUFF ON UPPER ARM 
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CHANGE IN LATENCY (MSEC) 
a 
7 


TIME (MINUTES) 


pressure cuff on the amplitude and latency of 
function of time. (Note the small differences 
below the recording electrode and cuff on the 


the involved nerve membrane potentials for 
new equilibrium levels. The oscillation would 
be related in this case to refractory periods 
and a nonmonotonic accumulation of after 
potentials during the repetitive train of stimu- 
lations. 

The second hypothesis assumes that the 
oscillation is not a function of the nerve mem- 
brane dynamics of the stimulated nerves, but 
rather is a result of reflexively generated 
centrifugal activity, Thus, in this case, efferent 
activity would be assumed to be interfering 
with the afferent activity represented by this 
compound action potential. 

The existence of such reflexive activity has 
been established. Toennies (1938) has de- 
scribed a dorsal root reflex of sensory fibers 
in which spike action potentials are reflected 
back along the same afferent fibers which 
conveyed the nerve impulse away from the 
point of stimulation. Fullerton and Gilliatt 
(1965) have more recently reported another 
axon reflex in human peripheral nerves which 
appears to be produced at branch points in 
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efferent fibers stimulated antidromically. The 
fact that the effect reported in this present 
experiment was more pronounced at the low 
stimulus intensities at which sensory fibers 
are more likely to be excited (Dawson, 1956) 
suggests that the dorsal root reflex is a more 
likely explanation, if a feedback explanation 
ultimately proves to be more appropriate. 
Although the origin of the oscillations of 
successive responses is not clear, we can be 
relatively specific about the nature of the 
individual CEP evoked from human periph- 
eral nerves. Changes in the amplitude of the 
peripheral CEP seem to be almost entirely a 
function of the number of single axon re- 
sponses which are summed to make up the 
integrated response. This holds true under 
even the most extreme environmental condi- 
tions as illustrated by a recent study from 
our laboratory (Uttal, 1964) in which a 
sphygmomanometer cuff was used to create 
an ischemic condition in the arm. The study 
used the same paradigm described above— 
electrical pulse stimuli were used to stimulate 
major trunks of peripheral nerves, and the 
CEP was recorded from a superficial point of 
the nerve trunk further up the arm. The 
pressure block was imposed in one experi- 
ment above the recording electrode and in 
another experiment below the recording elec- 
trode. In each case, the reduction in nerve 
response amplitude at the recording point 
was the same as a function of time (see Fig. 
3). In the case in which the cuff was below 
the recording electrode, the blood supply of 
the nerve was normal at the recording point. 
Since the amplitude of response of a nerve 
fiber should be normal in a region of a nerve 
which has normal metabolic 
if it has arrived through an 
this result suggests that the 
ripheral nerve CEP is due to the reduced 
number of individual neuronal responses, 
Thus the reduction in amplitude is due to 
heightened thresholds at the ischemic stimu- 


lating point rather than to reduced action 
potential amplitude at 


point. 
Because so much of the variance in CEP 
amplitude is accounted for by the number of 
responses, even in this extreme ischemic en- 
vironment, it is not inappropriate to conclude 


conditions, even 
abnormal region, 
reduction in pe- 


an ischemic recording 
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: CEP 
that the amplitude of a peripheral 4 
in a given human subject is a V 


ing neurons: 
measure of the number of firing 


on this sub deal of literature 
e! wi nl- 

j x i ts ith a 

n J from experim T ae 


cutane- 
sponses generated and recorded aia a 
ously, and data such as these val type 
understand the nature of this spec 
recording situation. 


n ial 
Compound Evoked Brain Potentta 


From the preceding discussion m out © 
that we have a fairly good theo The sit- 
genesis of the peripheral nerve “7 seuss the 
uation is not so clear when we ive stimu 
CEP which is evoked by impus ed 
tion of peripheral nerves and rec pject. 
the intact skull of the human sU Taing 

One of the major issues cone jasting 
CEPs is the origin of the long 
slowly changing recorded ano and 
Purpura comprehensively TENA otentia pid j 
of the genesis of cortical P odels W jals 
stated that the early axonal T poten pe 
were invoked to explain slow br aio “atel 
could no longer be accepted, 4 oct 
cause spike activity could no tentials: Tee 
experimentally with the slow p° peen h 
models, he further stated, have ~s griti 
by newer ideas in which lowe t esizê p” 
synaptic potentials are hyp? al 0 
basic subunits which are sum jals 
duce the slow electrocortical p° 

Amassian, Waller, and Maai 
also expressed this same PO!” 
recent symposium on sensory sons, 
in man. For a number of TA of 5 
the experimental disassociatio and 
slow activity mentioned abing tion f 
erations of the volume reco“ ae aC e Ca 
they also suggested that the SP to tE y 
tentials do not contribute direc z 
recorded from the surface ° 
also stated that the slow Po 
known to exist, such as the i 
after potentials, are added t08 sad” Oe 
the wave form. port isi i 

Much of their evidence a an? yo" 
viewpoint is in terms of P 
small sample of neuronal 
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Modern computing equipment allows much 
larger sample sizes to be collected, and, on 
the basis of these large samples, relations do 
emerge between spike activity and the slow 
potentials. 

Fox and O’Brien (1965) used a combina- 
tion of computer analysis techniques to dem- 
onstrate a very high correlation between the 
frequency histograms of latencies of a single 
cortical cell as recorded with an extracellular 
microelectrode and the averaged CEP as re- 
corded through the same electrode but with 
an amplifier passing lower frequency signals. 
The relationship between these two measures 
is rather astounding (see Fig. 4) and indicates 
that spike action potential may have been 
somewhat prematurely rejected as a major 
contributor to the slow potentials. Fox and 
O’Brien’s work suggests that the CEP is a 
measure of the probability of spike firing and 
demonstrates the missing relation between 
spike activity and the CEP. The work on the 
amplitude oscillation of peripheral nerve CEP 
described above also demonstrated a link 
between the slow response and the spike ac- 
tion potential, regardless of the origin of the 
oscillation. 

The axonal and the slow dendritic or syn- 
aptic explanations may, however, not be quite 
as separate as they seem. Synaptic and den- 
dritic potentials may be themselves very 
closely related to the spike activity, being 
precursors of spikes with the same encoded 
information. We may, therefore, find our- 
selves replicating the history of atomic phys- 
ics wherein two seemingly contradictory the- 
ories were vigorously debated only to be 
found, in later years, to be essentially different 
expressions of the same information. Thus 
the CEP might become a useful sign of ‘brain 
activity conveying a summated form of the 
information carried by both the spike and the 
graded activity of cortical neurons, From the 
point of view of the psychologist, the spe- 
cific origin of the potentials then would be- 
come a question of lesser importance, 

For the time being, let us consider only 
that the genesis of the cortical CEP is still 
being vigorously debated and that while some 
feel that it strongly relates to synaptic and 
dendritic graded activity, challenges have 

been inserted into the discussion which argue 
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strongly for direct spike-action-potential con- 
tributions to the cortical CEP. 

Another controversy concerning the gene- 
sis of the cortical CEP revolves around the 
question of whether these potentials are ois 
tical at all. Bickford, Jacobson, and Coy 
(1964) have proposed that many of * 5 
components of the evoked potential opel 
larly parts of the responses to acoustic een 
lation) are not neurogenic, but een 
activity of muscles time-locked to sec 
ulus. Using a deceptively simple expen? of 
they suggested that major component no 
the evoked cortical potential are indee 
cortical but arise from the neck muscles. 
the other hand, recent studies such as n 
one by Domino, Matsuoka, Waltz, ‘al 
Cooper (1964) demonstrate that poten 
recorded through the intact skull are a 
identical in shape to those recorde houg? 
the exposed surface of the brain, err 
greatly reduced in amplitude. On the ers 
this type of evidence, we now feel that 


i the 
i a 
ford’s warnings, though prudent aoe out 
time they were expressed, have bee? E 
indicated by newer direct evidence. the 


. 7 at 
Brazier (1964) has cautioned tne ding 


averaged evoked potential as @ e it is 
measure may be misleading, even wor le, 
demonstrably cortical in origin. FO" a sig 
the average of a very large amplitude exten! 
which is varying in latency tO 2” -could 
greater than the length of the respons m 
lead to an averaged response with zer° 
tude. . 
Another caution which has not ee, 
expressed involves the variability of t “0-00 
cal potential associated with = 
or session-to-session variations wile” of a 
serial measurement. The computation 
averaged evoked potential is 4 : eral Fe 
time-consuming process, and the ger ged | 
cedure has been to record the av the 


A Ke 
Sponse to, say, 80 presentations “pon ti 
stimulus, alter the stimulus, 27 stie 
the averaged response to another ingle nf 
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curring during the experimental session may 
actually introduce a variability into the serial 
records greater than that produced by the 
stimulus variations. Counterbalanced order of 
stimulus Presentation can, in part, solve this 
Problem, but many investigators probably ig- 
nore this issue of sequential effects. 

We have Carried out experiments in which 
m eect of simply allowing a subject to sit 

e experimental room and respond to a 
constant stimulus over a period of an hour 
Was measured. Figure 5 is a series of photo- 
8taphs of records from a subject in which 
each Photograph is the plot of 100 sequential 
averaged evoked responses. The decline in the 
amplitude of portions of the wave as a func- 
tion of time is evident, even though the stim- 
ulus conditions (.5-millisecond pulse dura- 
tion, 7-milliampere stimulus intensity) were 
constant, This result implies that it is neces- 
*ary to have an acompanying response which 
ìs developed during the same time period as 

€ responses to the test stimulus. This stand- 
ard response has been obtained in our labora- 
tory by averaging alternative records for two 
different stimulus conditions. 

A standard stimulus, constant throughout 
the entire experiment, is presented on odd- 
Numbered stimulus presentations while the test 
Stimulus condition, varying from one step of 
an experiment to the next, is presented on 
even-numbered stimulus presentations. The 
computer controlling the experiment averages 
the even-numbered and the odd-numbered re- 
Sponses separately. A comparison is then made 
between the two averaged records to deter- 
Mine the effects of the stimulus. 

The averaged evoked potentials are, there- 
fore, collected during an identical time pe- 
tiod for both the standard and the test stimu- 
lus. The price that one pays for this is to 
reduce even further the number of measure- 
Ments that one can make during a single ex- 
Perimental session. On the positive side, 
though, is the fact that measurements can be 
Made in terms of deviations from the stand- 


q response. 
Ie Such a technique should solve many prob- 
win but the psychophysiologist still is faced 
Ì ma the problem of deciding which of the 
| ay Parameters of the complex wave form 
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Fic. 5. Sequential averaged compound evoked 
cortical potential taken over a period of an hour 
with constant stimulus conditions to demonstrate 
the variability of the brain compound evoked po- 
tential without variation in the experimental stimuli. 


of the CEP he should measure. Certainly 
latency of the various components is one sig- 
nificant measure, but so are other measures, 
such as amplitude and area under the curve 
relative to some arbitrary zero potential. We 
shall not defend any specific measure, since 
convenience, relevance to the particular study, 
and simple intuition all contribute to the evo- 
lution of a satisfactory measure. And, indeed, 
the determination of which particular meas- 
ure one uses is the very essence of the in- 
quiry we are pursuing in this paper. We are 
examining these potentials to determine if 
they do, by some measure or another, relate 
closely to the psychological code. To select 
one or the other at the present time would 
only beg the question. 
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Over the last few years, a number of in- 
vestigators have explored the problems of the 
evoked potentials in search of functional re- 
lationships between stimuli and the variations 
of the CEP. We shall not be concerned with 
this physiological data, but in this short re- 
view we shall deal only with those experi- 
ments which directly compare some aspect of 
human thought or behavior with some ap- 
parently equivalent electrical potential. The 
study of the physiological parameters is, of 
course, basic to a full understanding of the 
latter class of experimental findings, and the 
reader is directed to the work of Dawson 
(1947) and Rosner and his colleagues (Alli- 
son, 1962; Goff, Rosner, & Allison, 1962), in 
particular, for the foundation necessary to a 
full understanding of the physiological or- 
ganization of these responses. 

In an earlier work (Uttal, 1960a) on the 
comparison of the peripheral nerve CEP and 
the psychophysical judgment of apparent mag- 
nitude, we concluded that the amplitude (i.e. 
count of the number of activated fibers) was 
the critical value in the encoding of subjec- 
tive intensity. However, to the best of our 
knowledge no study on the relation of the pe- 
ripheral CEP to the quality of sensation has 
yet appeared in the literature, and there is 
therefore, nothing to say about this type of 
encoding. 

A number of papers discuss the relation of 
the brain CEP to various aspects of psycho- 
logical performance, The final sections of this 
report will review the evidence fo: i 
between the brai i ation 

e brain CEP and the stimulus and 
pe psychological parameters. 
e association between certai 
pects of the brain CEP with pei ful. 
ness of the stimulus pattern has been in E 
gated by a number of recent experi oe 
Meaningful anges 

. ingfulness has several different defini 
tions in this group of experiment: et 

In the first (Haider, ea Li 
1964), meaningfulness was dekned ees 
of the discrimination of a visual sti ilus 
which was slightly less intense th aet 

i an the others 
which occurred before and after. The subject 
was instructed to respond to the randomly in- 
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terspersed less intense stimuli by pressing 4 
key. During the time period following the 
stimuli, signals from macroelectrodes on the 
head were recorded on an analog tape recorder 
for later analysis. The details of the experi- 
mental design are not germane to this discus: 
sion, but the general results were: (a) The 
amplitude of the evoked potential declined 
over the 1.5-hour period of the experiment; 
(b) There was a corresponding decline 
the percentage of correctly detected sig"? 
stimuli; (c) On the basis of @ separate ai 
eraging of those electrical responses tO ee 
which were missed and those which were 7 
rectly identified, it was found that respons 
to correctly identified stimuli were associat 
with CEPs larger than the CEPs with la 
responses to incorrectly identified stim! 
were associated. 

The authors of the report conclu 
CEP may serve as a relatively 5% 
measure of the state of vigilance of w 
ject. This experiment is not convincing, 
ever, for although there may indee ¢ the 
correlation between the amplitude ae 
evoked potential and the state of VIBE one 
it seems likely that the attenuation ™ 
hancement of the CEP may be 4° 
nomenon unassociated with the i show” 
activity they seek to measure. We a 
(see Fig. 5), for example, that que atent” 
tion in the amplitude of the evoke 
goes on under conditions in which mores 
lus does not change at all. Further ent fo 
would seem that Bickford’s are wo 
myographic contamination of the £ atio”, P 
be most applicable in just this situ yit? d 
attentiveness might well be associate 
generalized orientation reaction 
be more directly measured by my dy, yn? 
cordi he boi i 

ings from other parts of t vets? yee 
damaging to their argument, nower ont? of 
pupil dilation was not measure 
Pupil size, varying with accom; 7 
general alerting reaction, could Ea he ype 
CEP in the direction indicate“ |<, 
periment without a direct rela ance wn 
cortical processes involved in = etl gt 
an experimental situation iS "gue t ce 
susceptible to technical artifacts sit 
shifts in the location or contac f 
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the recording electrodes with scalp and head 
movements. 

Somewhat more convincing, though still 
Subject to the same sort of criticism, is a simi- 
lar study by Davis (1964). In this study the 
meaningfulness of an acoustic stimulus pat- 
tern was related to evoked potentials in a 
Vigilance task. The stimuli were tones which 
might or might not vary in intensity. It was 
the subject’s task to determine whether or not 
the third of four tone bursts was louder than 
the second. The first and fourth served as 

ready” and control signals, respectively. The 
Separation between each of the four tone 
bursts was 2.5 seconds. The results of the ex- 
Periment indicated that the averaged CEP re- 
Sponse to those stimuli in which a decision 
task was called for was larger than when no 
decision was called for. The subject had been 
instructed in certain of the trials not to make 
a discriminative judgment, but simply to push 
the response button each time he heard any- 
thing or, alternatively, not to attend at all to 
the experimental stimuli, but rather to read a 
Magazine during the experiment. 

A criticism of this experiment is, of course, 
that the time between stimuli was also long 
enough to allow the subject to orient in some 
Secondary fashion, that is, to develop a new 
“attentive state.” It may be that the measure 
of attentiveness or vigilance suggested by 
both of the two preceding reports is an in- 
direct effect generated either by myographic 
Contamination or by some indirect physio- 
logical attenuation of the signal. It would 
have been no more surprising if the CEP had 
actually been larger with lower levels of de- 
tection performance, just as the alpha rhythm 
mtalia troencephalograph increases when 

There is Er a et ean ne mane 

RA , a third study in which 
the association between meaningful and non- 
meaningful stimuli and the amplitude of the 
CEPs is more rigorously demonstrated, Chap- 
man (1964) described an experimental para- 
digm in which irrelevant influences were more 
carefully controlled. His subjects did mental 
arithmetic, adding two numbers which were 

Presented on a single visual display preceded 

by a noninformative warning light and with 

another noninformative field separating the 

‘wo numbers, All four of the visual stimuli 
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occurred within a second as the displays 
changed from one to the next in a quarter of 
a second. This rapid action reduced greatly 
the amount of time for changes in muscular 
set or other orienting reactions. In this ex- 
periment with appropriately counterbalanced 
practice order, the evoked responses to in- 
formational stimuli are significantly greater 
than the evoked responses associated with 
noninformational stimuli. 

This experiment adds credence to the argu- 
ment of the two weaker studies on the effect 
of meaning or attentiveness on the CEP from 
the brain. In this case, more adequate control 
had been effected which strengthened the 
argument that the enhancement of the evoked 
potentials for significant stimuli is a true ef- 
fect associated with the neural bases of the 
related intellectual processes, rather than an 
epiphenomenon produced by only incident- 
ally related preparatory responses. Unfortu- 
nately, too little is known of the latencies of 
preparatory responses to make this experi- 
ment crucial. 

Among the first investigators to make a di- 
rect comparison between psychophysical meas- 
ures and CEPs were Geisler, Frishkopf, and 
Rosenblith (1958). They found that the 
evoked response produced by an auditory 
click stimulus appeared in the averaged CEP 
at approximately the same stimulus intensity 
as that at which the subject first reported he 
was able to detect the stimulus. 

Rapin (1964) has also found a moderately 
high degree of relation between the audio- 
metric threshold and the threshold for the 
averaged evoked potentials. These are impor- 
tant results because they, along with the phe- 
nomenon of saturation demonstrated in som- 
esthesis (i.e, medium level stimuli produce 
maximal electrical potentials, Uttal & Cook, 
1964), suggest a wide degree of spatial diver- 
gence of the induced excitation as it proceeds 
up the ascending pathways. The temporal 
course of the CEP, of course, is also very 
much prolonged compared with the duration 
of the stimulus, and this suggests that a great 
degree of temporal dispersion also exists. 
Nevertheless, the Rapin and Geisler et al. 
studies are illustrative of a very important 
correlation between the CEP and the psy- 
chophysical judgment at the threshold level. 
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The saturation phenomenon 
above does not, however, correspond to psy- 
chophysical judgments of higher stimulus lev- 
els. Figure 6 shows the amplitude of the 
evoked potential for a typical subject as a 
function of stimulus intensity, and Figure 7 
shows the related monotonic increase in the 
subjective magnitude estimates (from Uttal & 
Cook, 1964). Since the CEP amplitude does 
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Fic. 6. The relation between stimulus intensity and amplitude of the brain 
compound evoked potential. (N 


ote that the amplitude reaches near maximum 
thus demonstrating the phenomenon of satura- 


ndë t 

Jitu! 0 

not change while the subjective a ee 
varying widely, simple amplitude oe of Ea 
not appear to be adequate MEE an pee” 
jective intensity. If there had a nave 
tematic change at all, we ma i i J0% 
close to defining some coding rela op ysl? 
the psychophysical and the elect" 
cal variables. 


im 
i er + 
This then brings us to anoth 
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ESTIMATES OF MAGNITUDE 


STIMULUS CURRENT IN MILLIAMPERES 


Fig. 7, The relation between psychophysical estimates of apparent magnitude 
(Note the “monotonic” increase of apparent sensory 


Point. Rosner (1962) has pointed out that 
the functional relationship between the psy- 
chophysical and electrophysiological varia- 
bles need not be a linear one. Certainly the 
Spatial representation of the body in the 
Somatosensory area of the brain is better de- 


scribed as a topologically constant relation 
than as a linear one. In this scheme, the 


of the body as 
measured externally, This one example shows 


neural 
Psychophysical code is a 
linear one and that any such assumption 
of linearity begs the question this field of 
research aims at answering. 

In a different sort of study, investigators 
searched for a CEP correlate of the masking 
Phenomena found in near simultaneous elec. 
trical stimulation of the finger tips. Matin 
(1959) has shown the temporal function for 
the masking, while our laboratory: (Uttal, 
1960p) has investigated the interaction as a 
unction of distance (number of fingers) 


and the stimulus intensity, 
magnitude as stimulus intensity increases.) 


between the test and masking stimuli, One 
of the major results of the latter study was 
that there was no masking for stimuli applied 
to different hands, but that extensive masking 
occurred between fingers on the same hand. 
An examination (Uttal, 1964) of the brain 
CEP recorded under analogous situations has 
shown this same effect to obtain at least 
qualitatively in that no interaction could be 
demonstrated between the stimuli applied to 
opposite sides of the body. This finding is 
contrary to the results for the cat in which 
there was strong inhibitory interaction be- 
tween stimuli applied to opposite paws 
(Nakahama, 1958). 

Barlow (1964), in a difficult and complex 
experiment, studied the relation of the time 
at which visual stimuli were perceived to the 
components of the evoked potential. The 
stimulus in this experiment was the sudden 
movement of a spot on the face of a cathode 
ray tube. His findings were that the percep- 
tion of the Movement occurred within about 
15 milliseconds of the movement, but that 
the first components of the CEP appeared 
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much later, 80 to 90 milliseconds after the 
stimulus. Apparently, he did not detect any 
specific primary early evoked responses, such 
as those which are found in the cortical so- 
matosensory CEP. The difficulties in experi- 
ments of this sort are formidable, and Barlow 
stressed that his conclusions were severely 
limited by the ambiguity in measuring the 
time of visual perception. Nevertheless, his 
conclusions indicate a lack of correlation be- 
tween the CEP and the time of perception. 

Halliday and Wakefield (1963) have 
studied evoked potentials to cutaneous elec- 
trical stimuli in patients with lesions in the 
ascending tract of the spinal cord or brain 
stem which resulted in a loss of position or 
cutaneous sensation. They reported that the 
cortical evoked potential was not affected in 
cases in which the cutaneous senses were 
absent, but was interfered with when the 
joint-position sense was absent. They tended 
therefore, to attribute the evoked potential 
in its entirety to the joint-position sense and 
further concluded that it is carried by the 
posterior column pathways. 

Donchin, Wicke, and Lindsley (1963) re- 
ported a study in which the visual brain CEP 
evoked by two sequential flashes, a weak 
stimulus followed after a variable ‘length of 
time by a bright masking stimulus, seemed to 
reflect an analogous subjective response. In the 
analogous psychophysical experiment, it was 
found that a second stimulus blanked out 
the perception of an earlier one, if the earlier 
one occurred within a certain range of del 
(0-25 milliseconds). The brain CEPs ro 
lowed a pattern which the investigati J 
was the same. Thi gators felt 

ey went on to conclud 

therefo e 
simple al that the CEPs appeared to be 
mple algebraic sums of the responses pro- 


duced individu 
ally by the t 
. . i i 
p st and masking 
Within limits, therefore, 
n the data S 
electrocortical activity "assotiated ee ee 


interactions to paired flash sti i 

1S r stimulations m; 

from additive brain processes, This a h 
port the assumptions of other workers wh Ne 
used paired somesthetic stimuli [p. 1286] ai 


However, this is exactl 
, this xactly what those 
have worked with paired somesthetic n 
have not found. Allison (1962), Shagass and 
Schwartz (1962), and Uttal and Cook 
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(1964) all found that the second of two 
evoked responses was diminished in size dur- 
ing the 100 milliseconds following the initial 
stimulus, and that these temporal inhibitions 
led to vast deviations from a simple additive 
process. The difference between this group 
of somesthetic experiments and the results a 
Donchin, Wicke, and Lindsley may lie tpe 
fact that the stimuli used in the latter experi 
ment were so vastly different in amplituc e 


lamberts while the intensity of the masking 
flash was 260 millilamberts. Even if one = 


siders the saturation phenomenon, pie aa 
intensity of the blanking flash woul 5 il 


swamp out the effect of the first 100 
seconds. 


CONCLUSION 


The inconclusive findings 
experiments in the literature which 
direct comparisons of psychological er 
trophysiological responses in man restrict le 
answer to the question asked in t 
of this paper. Certainly, there has tO 
relation between nervous activity ae 
ior, but equally as certainly we bave ” gnould 
established the specific measures W° “once: 
make to “decode” the sensory expe! cial 
Perhaps there could not be a sing® 
experiment that would answer 
for us completely, because different $ 
mechanisms may be encoded i? 
languages. 

This brief survey does indicate, th 
that an increasing amount of ettor rii 
expended in investigating this probler tial 
that many investigators have made “ 
assumption that there does appe" 
decodable relationship. In peripheral 7 co? 
particularly (this means that we mus gely 
sider the somatosensory syste™ i 


; c 
since th ye AC ply 
e other receptor ner snacce™® io 

a 


Quality coding, on the other ie e of g 
pecans’ mod p 
F u: 
vagueness of defining the cutane™ sing K 


natural stimuli. 
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rela Seat unanswered issue lies in the 
~ of the CEP recorded from the 
OG Simi to psychological experience. It 
general e presumptuous to imply that any 
uesti answer can yet be made to this 
statistical The issue is whether or not this 
Sociation function—the CEP—has any as- 
ittelevant with mental activity or is some 
conclusi, epiphenomenon. The data are in- 
ÎS on ve. Several studies suggest that there 
association, while others seem to show 
inpia on, when simple measures of 
tunatel e and latency are tabulated. Unfor- 
explore. » aS a group, the experiments which 
stimul the relation of what we have called 
controle meaningfulness exhibit insufficient 
ambi S, weak stimulus situations, and 
Suous results. 
e uch additional research is needed. It is, 
sle ever, important to remember that a new 
ment has been inserted into this endeavor. 
h 18, now possible to carry out electro- 
: Ysiological observations on the intact and 
Ormal human being while simultaneously 
faking psychophysical inquiries. Although 
E suggestion of continued research may 
fem trite, Dawson’s creative contributions 
Now allow us to study both aspects of this 
Question in its most appropriate experimental 
animal, man. For psychologists, in particular, 
this is a crucial development. 
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CONVERGING OPERATIONS FOR PERCEPTUAL DEFENSE 


THOMAS NATSOULAS : 


University of California, Davis 


A 


discussion is presented of some recent experimental attempts to provide 


converging operations for a concept of perceptual defense. For present purposes, 


perceptual defense is defined 


as a relative failure of perception per se due to 


the emotional character of the stimulus. Experiments are evaluated and dis- 


cussed mainly 


in terms of their ability to eliminate as an explanation the 


response-bias hypothesis for differential accuracy between neutral and emo- 


tional words. In addition, following Blum (1955), 
; all the experiments discussed require, if differences in 
attributed to perceptual variation, converging 


is described and applied 
accuracy of recognition are to be 


a stimulus-effect hypothesis 


operations to eliminate this hypothesis. It is argued as well that the search for 


converging operations for 
in other areas of perception. 


“The term ‘perceptual defense’ is mislead- 
8 since there is no experiment known to 
this reviewer that contains the necessary 
Converging operations that would permit a 
Conclusion of defense in perception, per se,” 
Writes Eriksen (1957, p. 202). He is referring 
to the conceptual distinction between per- 
Ceptual and response systems emphatically 
drawn by Garner, Hake, and Eriksen (1956), 
and to their general proposals for an experi- 
mental methodology to distinguish perceptual 
effects from response and other effects, In a 
recent article, Bootzin and Natsoulas (1965) 
Claimed that the research area of perceptual 
defense has been exceptional in its concern 
with this distinction: “In fact the history of 
the area might be well characterized as a 
Collective search for experimental procedures 
and designs to reveal the workings of percep- 
tion unconfounded by other variables [p. 
461].” The present article discusses some of 
this history in the hope that awareness of the 
problems involved and analysis of some re- 
cent attempts to deal with them will make 
a solution more likely. Such a solution will 
be of general importance to the study of per- 
Ception, There is reason to believe that the 
invention of converging operations which, 
Riven positive results, would permit the con- 
Clusion that perceptual defense has occurred 
Will provide a methodology for the study of 
Numerous perceptual phenomena, a method- 


in 


of The author wishes to thank Theodore E. Parks 
riti g University of California, Davis, for his helpful 
ical discussion of this article. 
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perceptual defense has implications for methodology 


ology permitting unambiguous conclusions 
about perception, per se. 

The following discussion may be charac- 
terized as well in a negative way, as eschew- 
ing the strategy of reducing the substantive 
question of perceptual defense to an instance 
of the operation of more general processes, 
for example, perceptual readiness or set 
(Bruner, 1957), and then either (a) address- 
ing the methodological problems involved in 
demonstrating whether and how set affects 
perception as opposed to other processes,” or 
(b) deriving and testing predictions from a 
theory of set as it affects perception. This 
path is avoided. The present discussion is 
neutral with respect to kow the emotional 
character of the stimulus can affect percep- 
tion in order to concentrate on how it can be 
shown whether the emotional character of the 
stimulus affects perception. In other words, 
for purposes of this review, the question of 
explanation or mechanism is assumed to be 
subsequent to the development of an ade- 
quate methodology and the collection of 
unambiguous results about the perceptual 
process itself. 

? The attempts to demonstrate the influence of 
explicit set on perception (as distinct from other 
Processes) have themselves an instructive and as yet 
inconclusive methodological history. Some important 
recent chapters are Lawrence and Coles (1954), 
Neisser (1954), Long, Reid, and Henneman (1960), 
and Swets and Sewall (1961). Ralph N. Haber of 
the University of Rochester is preparing a 
retical analysis and review of this literature. 


theo- 
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Of the several attempts to provide the nec- 
essary converging operations for perceptual 
defense that will be discussed here, only the 
first had been reported in the psychological 
literature at the time of Eriksen’s (1957) 
summary statement. Blum (1955) presented 
quadrads of Blacky pictures for a fixed dura- 
tion to subjects (Ss) who had to say which 
picture was in each of the four positions. 
All Ss knew the 11 Blacky pictures very 
well, but none knew that the same 4 were 
being presented on every one of the 48 trials, 
rotated in positions from trial to trial. After 
the experiment, all Ss stated that they were 
unable to recognize any of the Pictures; in 
effect the experimental session was for them 
a guessing game. 

Independently of the tachistoscopic portion 
of the experiment, it was determined for 
every S whether he had conflict and showed 
repression in connection with each of the 11 
pictures. Of the total of 17 Ss, 6 (5) showed 
both conflict and repression for 1 or more 
of the 4 (7) pictures (not) Presented, 14 (14) 
Ss showed no conflict or repression for 1 or 
more of the pictures (not) presented. These 
are the four categories or conditions that 
Blum (1955) uses to argue for a perceptual 
effect; in the case of the Presented pictures 
his prediction is that there should occur 
fewer calls of emotional than neutral pictures 
and no difference should appear between emo- 
tional and neutral absent pictures. The as- 
sumptions are that, “(a) the subliminal stimu- 
lus would activate the conflict; and (b) the 
necessity to verbalize the conflict area at 
the conscious level (by calling it) would elicit 
defensive behavior [p. 26].” The results were 
as expected: the mean number of calls per 
picture were 9.42, 17.12, 15.20, and 16.69 for 
Presented pictures with and without conflict 
and repression, and absent pictures with and 
without conflict and repression, respective] 

, y. 

Although the purpose of Blum’s experi- 
ment was to demonstrat 


e the need for a 
concept of perceptual defense, his predictions 
follow from an assumption that defense oc- 


curs at the response level (with “the neces- 
sity to verbalize the conflict area”). It is 
fairly certain that what Blum means by per- 
ceptual defense is defense instigated by a 


stimulus but occurring in the response system 
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(hereafter referred to as the “stimulusett 
hypothesis”), Consistently with this, sub- 
Blum appears to have shown is that & den- 
liminal stimulus can affect response ne t 
cies. He proposes that this occurs as sich in 
of the stimulus activating a conflict wa! view 
turn affects response probabilities. ao sti 
of perceptual defense needs to be jksen’s 
guished from the one implied by eaten? 
(1957) summary statement about for pur 
in perception, per se” and adopted defense 
poses of this discussion. Perceptual of the 
here represents relatively “a failure Erik- 
perceptual system” (Garner, Hake, ted, the 
sen, 1956, p. 153). Intuitively sta seen 
tachistoscopic stimulus word is not it is 4 
clearly, in as great detail, etc., when 
source of conflict or anxiety. 
Blum (1955) then seems not to rging 

vided the necessary operations convê oweve" 
the concept of perceptual defense. forward 
he seems to have moved the searc 


have Pro 


ender 
by his attempt to evaluate respons? tures 
cies. Thus the purpose of the absen eing 


(while S believed these pictures We as 1° 
Presented and responded as SUC% for 
Provide a base line of comparison; on the 
effects of true or appropriate n epi 
Corresponding responses. Similarly, Ti ies 
differences will have to be A enci 
under conditions in which response e of i 
on their own cannot be the sour 
relevant differences in results. ) repot toy 
Mathews and Wertheimer (1958 
a experiment similar to on 
ure’ Measure of Perceptua jon- yas 
contaminated by Response Suppress pt "i 
this experiment, an indirect a rep” 
made to secure a score for each emo” et 
ing his tendency not to call = g 
words presented to him. The $S 
a list of eight words and told 
word flashed on the screen (sing then” por? 
brief duration) would be one of he # oi 
of the words were neutral (bY of 
judgment) and four were selectl sg Pho 
tional for each § on the basis 


word-association test. The neutr engt o 


S oO 

tional words were matched for ient ogot 
proximate structure,” and freg” ral th 
currence (Thorndike-Lorge dos” 


Count), As Blum (1955) P3 
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experimenter (E) flashed only some of the 
expected materials, four of the eight words, 
two emotional and their matched neutrals, 
Providing again four categories: words pre- 
Sented, neutral and emotional; words absent, 
neutral and emotional. 

For every S two z scores were calculated. 

the assumption of no difference between 
“motional and neutral words in response 
Probabilities, one would expect for the absent 
tds a number of calls of emotional words 
equal to Np where N is the total number of 
alls of the absent words and by the 
sumption is 5, An S’s z score for response 

PPression would be the actual number of 
ansent, emotional words called minus the 
on number (assuming no response 

Ppression ) in o units. The same scoring 
Procedure was used as well for the presented 

Ords; the z score in this case is believed 
w e a measure of both response and per- 
wl Suppression. Thus a subtraction was 

Wired to yield a “pure” measure of 
“tceptual defense, 

athews and Wertheimer’s positive find- 

88 for “perceptual defense” can be inter- 
Preted in the same way Blum interprets his 
wn experiment: there is a specific stimulus 
Sect on response tendencies, reducing the 
falls of the corresponding emotional words. 

© assignment of the difference between the 

Scores for present and absent emotional 

ords to Perceptual defense in the sense of 
poten, perception (Mathews and Wertheimer 
ag to “pure” and “true” perceptual de- 
nse) is faced with Blum’s competing 
Stimulus-effect hypothesis, Converging opera- 
lons are needed to eliminate it. 

Neither Blum nor Mathews and Wertheimer 
Save adequate attention to the accuracy of 
their Ss’ performance, Differential accuracy 
between experimental conditions or kinds of 
‘timuli is a necessary result of the operation 

Perceptual defense as here understood; if 
Mehow avoidance or defense is occurring 
i in his perceptual system, S would not 
nes emotional stimuli as well as neutral ones, 
th, therefore make more identification errors, 
pp Ere demonstration of fewer calls of 
hy tional stimuli seems always to leave the 
OF A esis open: Ss are perceiving both kinds 

muli equally well, but responding to 
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them differently. The next experiment dis- 
cussed uses both a measure of differential 
accuracy between anxiety-arousing and neu- 
tral words and a response-bias score for the 
same words. 

Goldstein (1962) attempted (a) to demon- 
strate that a response bias against calling 
anxiety-arousing words can give the appear- 
ance of a perceptual-defense effect, that is, 
fewer correct identifications of anxiety- 
arousing words than of neutral control words, 
and (b) to determine whether, over and 
above the effect of response bias on differ- 
ential accuracy, there is an effect attributable 
to perceptual variation. The first purpose was 
realized through the inclusion of a “stimulus- 
absent” experimental group similar to that 
used by Goldiamond and Hawkins (1958). 
On every trial, the members of this group 
were stimulated merely by typewritten hash- 
marks while being led to believe that words 
were being presented for subliminal dura- 
tions. These Ss, as well as those presented 
with true stimuli (the stimulus-present 
group), had to choose a response from an 
available list of eight words, four neutral and 
four anxiety-arousing (as independently de- 
termined for each S). The stimulus-present 
group was exposed on every trial to one 
of the eight words, and the words were pre- 
sented equally often. 

The magnitude of each S’s response bias 
was found by substracting his number of calls 
of neutral words from half (64) of his total 
number of responses. For the stimulus- 
absent group the mean response bias was 
—4.55, for the stimulus-present group —4.90, 
both Statistically significant biases against 
anxiety-arousing words, Furthermore, in the 
case of the stimulus-absent group, it was 
shown that this response bias produced a 
mean “pseudoaccuracy” score significantly 
favoring the neutral words by 2.45 more 
words correct. Pseudoaccuracy scores were 
obtained by scoring, as correct or not, the 
responses of the stimulus-absent group in 
terms of correspondences with the stimulus- 
present group. This is clear-cut evidence 
that a response bias against calling anxiety- 
arousing words can produce what might have 
been accepted once as a perceptual-defense 
effect. 
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The stimulus-present group’s mean differ- 
ential accuracy score also favored the neutral 
words, by 4.50 more words correct than 
anxiety-arousing words. This was not sig- 
nificantly greater than the differential pseudo- 
accuracy, indicating that presentation of true 
stimuli did not increase the amount of de- 
fense shown, Goldstein concluded that the 
results support a reponse probability theory 
of perceptual defense; they do not display 
significant variation attributable to per- 
ceptual differences between anxiety-arousing 
and neutral words. 

Goldiamond (1962) argues more gen- 
erally against ascribing differences in perform- 
ance between stimulus-absent and stimulus- 
present conditions to perception. He points 
out that such ascribing is based on the often 
unlikely assumption that S’s response biases 
are the same although different stimuli are 
presented. When two conditions do differ in 
response bias, differences between them in 
performance, that is, in the differential accu- 
racy of identifying neutral and anxiety- 
arousing words, cannot be ascribed readily to 
a perceptual process. In Goldstein’s experi- 
ment, the mean response biases of the two 
experimental groups do not appear to differ 
significantly; if differences in performance 
had been found between them, these differ- 
ences presumably could have been attributed 
to perception, so far as the present argument 
has gone. However, the matter of response 
bias is more complicated. Goldstein, Himmel- 
farb, and Feder (1962) claim that the pat- 
tern of response bias may not be identical 
between conditions even when the tendency 
to choose neutral over emotional words is 
found to be equal: “Thus not only is it 
Possible for the subject to show a response 
bias between classes of stimulus words, but 
it is possible for him to show a within-set 
bias resulting from the manner in which he 
distributes his guesses within a set [p. 60].” 

Suppose S has a tendency not to use one 
of a set of anxiety-arousing words, a negative 
tendency which is greater than that 


c for the 
anxiety-arousing set as a whole. Under 
stimulus-absent conditions, pseudoaccuracy 


would be unaffected by this so long as the 
bias against the total set of anxiety words 
remained the same. On the other hand, under 
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> n of 
stimulus-present conditions, 


ors erform- 
such a bias would limit the level 2 $ 
ance on that particular anxiety WO a eat 
perceiving a fragment of the wae 
narily sufficient (e.g., in the case Z would bë 
tral words) to identify the wore, 


words,” that is, 
accuracy correction, would SA percep 
support . . . the hypothesis © in t 
blocking mechanism |p. 60]. wi imen 
text of Goldstein’s (1962) ha ri a 
necessary, therefore, to demone ies is si 
distribution of response probabi ousing se! 
lar within neutral and anxiety-a" 5 
of alternative responses. , à Natsoul 
The experiment of Bootzin wrespans? T 
(1965) makes irrelevant the " utra we 
between anxiety-arousing and S others aa 
by always pitting against eo Secon e 
response, words of the same ‘bution nd 
it demonstrates that the dist"! neutra ‘Ny 
sponse probabilities within woe gia 
anxiety-arousing sets is the ie 4. 
with response bias eliminated a$ accurae a 
variation, it shows that the rds 1 mh 
identifying anxiety-arousing W° words: 
great as the accuracy for neutra! «erce pe 
difference is attributed tO ? nds tha 
blocking mechanism” on the q yp? 
competing, response-probability 
has been eliminated. startle 
Two quadrads: each containing © (4s 
and two anxiety-arousing WO" 
pendently determined for a 1 
within a quadrad matched "Ge 
letter, and Thorndike-Lorge : 
Count; were selected for each hM g 
words and a card with five e in a "eoo 
exposed tachistoscopically sim8 fo Oo! ad fh 
order; the hashmarks always time erste 
(32 times), the words four for 037 pak 
01 second and four times each 
Following each exposure, 
of a booklet and checked << ha 
words on a sheet; howeve": é 
information prior to a stimu 
as to which words these wou! 
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` Was always between two neutral words or two 


anxiety-arousing words. 

p erences associated with personality 
lowing om Plicate the results, and the fol- 
Vate F are the overall findings. No differences 
here ound except at the .03-second duration 
lm for both neutral and anxiety- 
and on words was well above chance levels 
iia ere was significantly less accuracy for 
hae ore than for neutral words. The 
portion response bias taken was the pro- 
Within be choices of the favored alternative 
tral wo pairs of anxiety-arousing or neu- 
asked rds. In other words, the question was 
ee as to whether Ss tended to choose as 
tions ee one of a pair of anxiety-arousing 
he ~ mie often than one of a pair of neu- 
the a or vice versa. It was found that 
: erence in accuracy between recogni- 


lo; , K 
S of anxiety-arousing words and neutral 
ie Was accompanied by no significant 


“tence in response bias as thus defined. 
°Otzin and Natsoulas argue that this ex- 
Spe Provides evidence for perceptual de- 
cing uncontaminated by response bias. They 
Out that the difference in accuracy (at 
“Second duration) found between anxiety- 
using and neutral words cannot be at- 
uted to differences either in response bias 
. '0 differences in the magnitude of the 
ulus-word fragment affecting S. This 
Ves, according to them, perceptual varia- 
dit, as the likely source of the observed 
erence in accuracy. 
pas difference in accuracy between the two 
nn of words, although uncontaminated by 
He a ag and within-set response biases, 
ma d be attributed to postperceptual and 
‘ eresponse events which are affected by the 
motional character of the stimulus. If the 


titference in accuracy is due to these post- 
‘ceptual events, then S must be seeing the 
‘hxiety-arousing words as well as he sees the 
pUtral words, but “losing” more of the in- 
mation received in the brief time prior to 
SDonse, This amounts to an immediate 
h Mory or categorization hypothesis m 
ave 'f that aspect of perception 2 E neu 
ng dance takes place is to be isolated, ap- 
hag iate converging operations. Perteno i 
Wte, tO an aspect of perception because the 
j ton of what constitutes perception as 
| 


er 
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opposed to categorization or even immediate 
memory is problematical. Some students of 
perception have defined perception as a kind 
of categorization, others have pointed out 
that the central process does not terminate 
with the stimulus. Swets, Tanner, and 
Birdsall (1961) have proposed that percep- 
tion is a choice among Gaussian variables, 
albeit a greater number than in the case 
of signal detection. The decision processes 
involved in making the choice are included 
in their definition of perceptual processes to 
distinguish them from sensory processes “on 
the grounds that the former must be ac- 
counted for in terms of events presumed to 
occur at higher centers whereas the latter can 
be accounted for in terms of events occurring 
within the receptor systems [p. 338].” Two 
basic intervening variables, on which depend 
the responses of the observer, are postulated 
by signal detection theory: sensitivity (d’) 
and the criterion used in making a judgment. 
As Atkinson (1963) points out, signal detec- 
tion theory assumes that the sensitivity of 
the observer is fixed or varies randomly over 
time while the criterion varies in a systematic 
way depending on a number of variables af- 
fecting set or motivation. In its place, Atkin- 
son (1963) proposes a variable sensitivity 
theory of signal detection which allows 
for systematic changes in sensitivity from 
trial to trial more nearly consistent with a 
conventional definition of perception, changes 
possibly due to “orienting responses, periph- 
eral changes within the sensory system, or 
events presumed to occur at higher centers.” 
Thus, Swets et al. (1961) include a great 
deal in their concept of perception—all of 
the postperceptual and preresponse decision 
processes—while Atkinson (1963) is con- 
cerned with an “activation process” distinct 
from the “decision process,” a concept of 
sensitivity which may not be determined 
exclusively by sensory factors. 

While Bootzin and Natsoulas (1965) find 
it necessary to demonstrate the absence in 
their experiment of differential within-set 
biases between neutral and emotional word 
pairs, Goldstein, Himmelfarb, and Feder 
(1962) seek to eliminate the relevance of all 
response biases to differential accuracy scores. 
Turning from Goldstein’s (1962) statistical- 
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correction method, they adopt a spatial 
forced-choice procedure. This procedure, = 
sembling the forced-choice methods ig 
by Blackwell (1953), does not — r 
subject to verbalize or otherwise ge 
what word has been presented. Insteal , the 
subject is asked to locate spatially a = 
ticular word when it is presented along wit! 
one or more others. Goldstein, Himmelfarb, 
and Feder (1962) presented on every trial 
(at a fixed brief duration) a pair of words, 
one an anxiety-arousing word, the other its 
matched, neutral control. The task for S was 
to call out the position of one of the words 
as instructed by E, while knowing, too, what 
the other word would be. The kind of re- 
sponse bias which could be present in this 
situation would involve a tendency to say 
“right” (or “left”) when a particular kind 
of word had to be located. Such a bias was 
found: most Ss under both stimulus-present 
and stimulus-absent conditions tended to say 
“left?” when the location of an anxiety- 
arousing word was asked for and “right” for 
a neutral word. 


According to Goldstein, Himmelfarb, and 
Feder (1962): 


Although the forced choice spatial location method 
is not free of its own peculiar type of response 
bias, the counterbalancing of position of stimulus 


and type of stimulus word asked for by E prevents 
the response bias of word by position from affecting 
the total accuracy score [p. 60]. 


What they are referring to is the fact that 
if anxiety words are “recognized” better on 
the left because of a tendency to say “eft,” 
they will be more poorly recognized on the 
right to the same extent as there is an ad- 
vantage for them on the left as a result of 
the same bias. 

As proposed in discussing Goldstein’s 
(1962) experiment, this general line of argu- 
ment is valid for chance congruences, but 
not applicable where S’s responses begin to 
be affected by the stimulus. Consider a 
particular anxiety-arousing word and its 
matched, neutral control on two trials in both 
of which the anxiety-arousing word is on the 
right and the neutral one on the left. On 
the one trial the anxiety word must be lo- 
cated, and on the other, the neutral word. 
Assume further (@) that on the two trials 


f 
the words which must be ioçatel Pe Tiig 
ceived equally well, and (b) the mer the 
say “left” to the instruction to than the 
anxiety-arousing word is ogg neutral 
tendency to say “right” when F he rela- 
word’s position is required. eT ait anxiety- 
tively strong bias in the case a to respond 
arousing word, S is more likely utral word, 
erroneously to it than to the ne ut avail- 
without utilizing efficiently the ie 
able to him. This nonpercepie 
for the neutral word would not 
sated for on another trial (when ‘ert is 4. 
word has to be located) where ntly saying 
chance congruence due to mene cneeded 
“left.” These chance congruences 4 A chanci 
to compensate for the relatively riety T 
congruences occurring when the ied. hus a 
is on the right and must be losi rocedure r 
the context of this forced-choice iis that A 
well, it is necessary to demon an p 
responding with respect to neu response b 
tional words there is no greater in 
in the one case than in the oe aldste ; 
No difference was iauna a in the ty” 
Himmelfarb, and Feder ua d an to 
curacies of locating A appe aty 
arousing words. In addition, EY" 4g an® ge 
be no difference between ie ma 
arousing words with respect a es. Pog 0 
of their respective position | P me? “yor | 
interpretation of the expet!M® | two 
obstacle: Ss’ knowledge of W ation or 
s its lo tO aot 
are flashed on a trial perm! "par ay Pol 
as a result of perceiving one effe k Us 
Thus the perceptual defens? © cog! Jo 
° y tial its geh 
manifest itself because Pat ir pe” ar f 
the neutral member of a P# mem? ctw 
tion of the anxiety-arousing A We 


ye 
ell. ti" 45 C 

though it is not seen ex alten got w 

which would eliminate 2°." qu nd Pol 


3 tin. a i 
pothesis would require s ords, of 
matched anxiety and neutr rds % 

3 s iety WO ect ve Vb 
senting a pair of anxiety 1, A corte PP of 
neutral words on each tria ‘would $ 
by process of elimination “ erceP | 

: the P LO ath 
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word of the same type- ed 2 ve “ul 
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procedure like the one eet oe i 

of 16 five-letter words Tape” 

by E to be emotionally " 
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ing and 8 emotionally neutral. From these 16 
words, 4 sets of 4 words, all words in a set 
emotionally unpleasant or all neutral, were 
selected and projected simultaneously at eight 
durations which ranged from chance level to a 
high level of accuracy in the performance they 
yielded. On each trial S was asked to locate 
à particular word. Each word had to be lo- 
cated once at each duration. The results 
showed no difference in accuracy of locating 
neutral and unpleasant words. Relating per- 
ormance to ranking by S of the words in 
terms of pleasantness—unpleasantness did not 
Yield a significant effect. As in the case of 
the procedures of Goldstein (1962) and 
Goldstein, Himmelfarb, and Feder (1962), 
interpretation of such results as indicative of 
Perceptual defense (had differences been 
ound) would depend on a demonstration of 
€qual response biases for emotional and neu- 
tral words, 

Goldstein, Himmelfarb, and Feder (1962), 
n discussing their own experiment, put for- 
Ward a criticism applicable to MacIntosh 
(1961); “Possibly we have undermined the 
motivation for perceptual defense by insist- 
g that the subject first Zear the word from 

e experimenter before he sees it himself 
Ip. 61].” Bootzin and Natsoulas (1965) ar- 
Sue similarly that with repeated exposures 
Within a brief period, the anxiety reaction to 
a word may habituate. The requirement of 
the spatial forced-choice procedure that § 

ear an emotional word and keep it in mind 
May hasten this eventuality. Bootzin and 
Natsoulas show that their Ss’ performance 
on the anxiety-arousing words improved from 
eee to the second half of the tachisto- 

ries to a point where, i 
half, the difference in accuracy e 
tional and neutral words was no longer sta- 
tistically significant. The neutral words, on 
the other hand, did not show a significant 
increase in accuracy between the two halves 
Of the series of tachistoscopic exposures, 

Another experimenter who has attempted 
to demonstrate a perceptual effect is Zajonc 
(1962), whose procedure, considerably dif- 

‘tent from those already reviewed, involves 
he consecutive phases: (a) determination 

visual duration thresholds for recognition 
6 taboo words and 6 neutral words by the 
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ascending method of limits, with GSR read- 
ings taken on every presentation of a stimu- 
lus word; (b) a paired-associates learning 
task in which these 12 words were the stimuli 
and an additional 12 (6 taboo and 6 neutral) 
served as responses; (c) a second determina- 
tion of thresholds and GSRs for the stimulus 
words in which half the Ss had to respond 
with the response words learned in the paired 
associate task and half had to respond in the 
usual manner, by calling the stimulus word. 

The point of the experiment was to dis- 
tinguish the contribution to threshold meas- 
ures and GSR measures of the response made 
as opposed to the stimulus presented. The 
first part of the experiment (phase @ above) 
would not discriminate between the stimulus 
and response effects because a taboo word 
had to be called when recognized. Zajonc 
found that the taboo words provided higher 
recognition thresholds and greater GSRs. 
Garner, Hake, and Eriksen (1956) propose 
two hypotheses to deal with such results: 
either “the perceptual system differentially 
discriminates on the basis of the emotional 
content of the perceived stimuli” or “the 
difference in threshold is a result of a char- 
acteristic of the response system, which in- 
hibits the verbalizing of some of the words 
[p. 151].” They then suggest that the next 
steps taken by Zajonc in his procedure would 
be the necessary converging operations to 
eliminate one or the other of the two hy- 
potheses. 

The third phase of the procedure, by re- 
quiring Ss to call at times words from the 
neutral category to indicate perception of ta- 
boo words and of other neutral words, and to 
call taboo words to indicate perception of 
neutral words and of other taboo words (as- 
sociates learned in phase 5), permits evalua- 
tion of the contribution of the stimulus and 
of the response to the measured thresholds. 
For the group responding with the learned 
associates, taboo and neutral stimuli did not 
differ significantly in recognition thresholds 
although responding with taboo words yielded 
higher recognition thresholds than responding 
with neutral words. The other experimental 
group provided recognition-threshold results 
similar to those in phase @. Some effect of the 
stimulus was shown on GSR, but the response 
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was the primary determinant. From these 
results Zajonc (1962) concludes, “The evi- 
dence presented failed to disclose perceptual 
effects of any significance. The recognition 
threshold was found to be a function not of 
what S saw but what he had to say [p. 213].” 

If the Ss had not seen the taboo words as 
well as they saw the neutral ones, there 
would certainly have been an observed stim- 
ulus effect. To evaluate the design with re- 
gard to its ability to demonstrate perceptual 
defense, it is necessary to ask whether the 
occurrence of an observed stimulus effect 
would have provided unambiguous evidence 
for a perceptual effect (in the sense indicated 
earlier of a relative failure in perception). 
Blum’s (1955) — stimulus-effect hypothesis 
would seem to be a competing hypothesis 
requiring elimination by converging opera- 
tions; in Zajonc’s experiment an observed 
stimulus effect could be interpreted in terms 
of this hypothesis by stating that taboo words 
produce conflict which inhibits the appropriate 
responses from occurring. The fact that the 
response to a taboo stimulus was a neutral 
word would not eliminate Blum’s hypothesis, 
because the inhibition may be extended to 
responses other than the reading response 
associated with the stimulus. 

Zajonc (1962) views “defensive perceptual 
blocking” as equivalent to the “contribution 
of the stimulus [p. 206],” but Garner, Hake 
and Eriksen’s (1956) reference to the per- 
ceptual system’s differentially discriminating 
suggests that they are adopting a definition of 
perceptual defense as occurring within the 
perceptual system. The next step in the 
search for the necessary converging opera- 
tions for perceptual defense will require, if 
possible, the elimination of the stimulus-effect 
hypothesis, just as recent experiments have 
been designed to eliminate the hypothesis of 
the response effect. The experiments of Gold- 
stein (1962), Bootzin and Natsoulas (1965) 
Goldstein, Himmelfarb, and Feder (1962). 
and MacIntosh (1961), not previously diss 
cussed in relation to the stimulus effect, can 
all be interpreted according to Blum’s hy- 
pothesis. For example, the Bootzin and Nat- 
soulas findings of greater accuracy for neu- 
tral than emotional words were interpreted as 


showing perceptual differences, because the 
response effect was shown not to be respons 
ble for the difference. However, the difference 
in accuracy may be due to S’s perceiving 
both kinds of stimuli to the same degree, but 
responding differently as a result of the con 
flict engendered by the emotional words. This 
would not appear in the response bias scores 
because it would work both ways: when emo- 
tional word a was flashed there would be a 


e 
tendency to avoid calling it, as there woul a 
when emotional word b of the pal ie 


flashed. The response bias score refers ue 1 
preferred response, but neither a nor b wou i 
be preferred more than the matched ge 
words, unless they differed significantly in © 
degree of conflict they produced. 

A last point concerns the assertion ™ 
the beginning that a search for conve 
Operations for perceptual defense wil jing 
repaid by the discovery of methods navi 
more general applicability in the study in 
perception, Again to illustrate, the B00 
and Natsoulas (1965) proced 
considered. The stimulus effect which i 
be eliminated as an alternative hypothesis 
true perceptual defense is to be demonstr“ g- 
is not relevant in situations where emoe a 
ality or conflict is not a variable. T” a 
cases, to find that stimuli which vary ae 
Systematic way and are projected fort 
durations yield different accuracies o: ide 
fication, where between-set response bias z 
irrelevant and within-set biases ave 
strably equal, is to find evidence fO" Pad 
ceptual variation. Taylor Rosenfeldt, clu- 
Schulz (1961) would “object to this CM hy. 
sion on the grounds of an alternative |. 


ade at 
rging 


a 


ing 

A rdin? 
pothesis to perceptual variation. oem 
to this hypothesis, two classes Of -itely 


which are shown to differ in how accur 
they are identified with a forced-cholce 
nique such as that of Bootzin and aa 
(1965) might so differ because of tP ti 
tion of perception to response. enol ass- 
may be constant across conditions but ag er 
mg or identifying on the basis of part vio" 
ception may differ between them. “fof 
Rosenfeldt, and Schulz (1961) suga peli- 
purposes of eliminating or reducing t e 
hood of this competing hypothesis, 
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If zari: 

ekaa oe that has been shown to influence 

testes a avior can be shown xot to influence 

‘ented ax lentification when partial cues are pre- 

i ae af a a level, a firmer basis would 

ofthe ee to support a perceptual interpretation 
er [p. 495]. 


Ina forced-choice procedure of the kind used 
thay oe and Natsoulas (1965), it seems 
feat a tl ough it must be shown, that the 
rerent die of the responses to S would 
differents is kind of variation from affecting 
N ial accuracy scores. 
nea has been taken, in the above 
Sheet > pale directed discussion, with re- 
stimulu low the emotional character of the 
the a may have its Negative effects upon 
lons ceptual process itself. Such considera- 
ogi Ke left to the development of psycho- 
Oty: th and perhaps neurophysiological the- 
o A eae the present view is neutral as 
Useful = her, for example, it is scientifically 
Scions o introduce such a concept as “uncon- 
Telatiy pereeption, in order to explain „the 
arly A ailure of perception, per se. Simi- 
À HE stimulus-effect hypothesis must not 
Peni ge as involving necessarily the pro- 
Ypoth a specific mechanism. All that this 
h esis maintains is that stimuli of a cer- 


ain $ ‘ 
iis ee character can have specific 
Syst S of defense or avoidance in the response 


e : i 
ey m, as a result of the conflict or emotion 
Syst Produce, while leaving the perceptual 
“em unperturbed. 
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CONCEPTUAL ORIENTATIONS OF CONTEMPORARY 
SMALL GROUP THEORY* 
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In an attempt to study some of the conceptual and content parameters of 
contemporary small group orientations, 2 classificatory systems were developed 
and applied to a sample of empirical hypotheses derived from 6 such orienta- 
tions. The results of this analysis provide data on the similarities and differences 
between these points of view on 3 dimensions: (a) the size of the social unit(s), 


(b) the social process level(s) with which they deal, and (c) the substantive 
content of the variables which they employ. 


In surveying contemporary conceptualiza- 
tions of small group behavior, one emerges 
with the feeling that there are both simi- 
larities and differences among them. In some 
places, two or more authors seem to be writ- 
ing about the same variables in the same 
way; more importantly, in other instances 
they seem to view similar processes from 
different points of view. In any attempt to 
evaluate findings concerning behavior in 
small groups a difficulty arises in that points 
of agreement and disagreement are often not 
explicit, but rather are obscured by differ- 
ences in terminology, units of analysis, levels 
of aggregation (cf. McClintock, Becker, & 
DeLamater, 1963), etc. 

These similarities and differences in con- 
ceptualization are important because they are 
reflected in many of the critical hypotheses 
found in those studies which form the empiri- 
cal basis of the field of small group behavior. 
Since many of these studies derive their con- 
ceptual orientation from one of several prin- 
cipal points of view, an “objectification” of 
differences in these approaches can illuminate 
considerably the underlying similarities and 
differences in hypotheses, techniques of oper- 
ationalization, and even results extant in 
contemporary small group research, 
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The present paper presents a first cat 
at such “objectification.” It was ba ities 
some understanding of theoretical sim! a ig 
and differences could be gained by exa™ rs 0 
certain conceptual and content paramete rin- 
current research based on each of SIX Pior: 
cipal orientations to small group ae 
Bales Cartwright an r 


> pe 
in general, see Lambert, 1963.) MOT “ino 


cifically, an analysis was made of (0) the 
size of the action units around whic view 
variables employed by these points ° cial 
are conceptualized, (b) the types ci sub” 
processes conceptualized, and (e) 
stantive content of the variables = T 
tations as they appear in empirical ited by 
especially those studies conducted oF © el. 
the theorists as illustrative of their on jew” 
In some ways this analysis can ef ns? 
as a complementary effort to the > Ga 
research programs conducted bY ; n ant 
(1962a, 1962b, 1962c) and pe ne, 
Terauds (1960), The latter have it 
a method for classifying small grouP 
rather 


in terms of their operational properties n pesi?” 


their conceptual/verbal properties «+ ~ 
terms of their substance or content ( 
group structure, etc.), but in terms ° 
data characteristics of variables, i€» 
tional definitions, 
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CONCEPTUAL ORIENTATIONS 


Thus, the present analysis provides a mea- 
Sure of the current conceptual orientations 
in the field of small group behavior on sev- 
: dimensions, that is, the extent to which 
E RES cover the range of action 
A the degree to which they include 
k us types of social processes and sub- 
e variables in their formulations. In 
riton; it indicates the similarities and 
i erences between points of view on these 
oe and thus hopefully increases our 

nceptual understanding of both the orienta- 


ti i 
en and the research which are derived from 
m, 


DATA SELECTION 


ean ea of empirical studies representing 
Eea the six orientations cited above was 
at e on the basis of four criteria. The 
Fas a primary criterion for study selection 
at at it be referenced in the principal 
pce a works written about the conceptual 
cussed Empirical studies which were dis- 
chosen or reprinted | in these works were 
the bee source studies. In each case where 
e a work merely discussed a study, the 
analysi article was used as the basis for 
experi s. Where an insufficient number of 
Work ments were represented in the principal 
Seka ee experimental reports were 
Set of from those referenced in the initial 
studies , experimental studies. The specific 
ih eo in the analysis are indicated 
i 
staan criterion of selection was that all 
Speci. included in the analysis contain (a) a 
etived hypothesis or hypotheses deductively 
Orient from or closely related to the general 
—_ ation, (b) a clear specification of the 
bera nization of and the methods of 
ate lonalizing each variable, and (c) a 
sults. tt and statistical analysis of the re- 
Was - The presence of these characteristics 
diag te necessary to facilitate variable 
Cation, 

spre, hird principle of selection was that 
Oximately 15 tested relationships (hy- 


4 
ihe primary sources for studies were as follows: 
hd (Hare, Borgatta, & Bales, 1955), Cartwright 
Zander (1960), Homans (1950, 1961; Riecken 
Omans, 1954), Moreno (1956, 1960), Newcomb 
61), and Thibaut and Kelley (1961). 
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TABLE 1 


Source STUDIES oF VARIABLES EMPLOYED 
IN THE ANALYSIS 


C tua 
e Source studies 
Bales Bales, 1955, 1956; Bales & Strodtbeck, 


1960; Bales & Slater, 1955; Borgatta 
| & Bales, 1955a, 1955b; Heinicke & 
| Bales, 1953; Talland, 1955. 


| Cohen, 1959; Deutsch & Gerard, 1960; 
Festinger & Thibaut, 1951; Horwitz, 
1960; Hurwitz, Zander, & Hymovitch, 
1960; Pepitone & Kleiner, 1957; Pepi- 
tone & Reichling, 1960; Raven & 
Reitsema, 1960; Schachter, Ellertson, 
McBride, & Gregory, 1960; Stotland 
& Zander, 1958. 


Cartwright 
& Zander 


Homans Homans, 1950, 1961; Reicken & Homans, 


| 
| 
| 
| 1954. 


Moreno Deutschberger, 1947; Kidd, 1960; Mo- 

| reno, 1960; Mouton, Bell, & Blake, 
| 1960; Northway & Detweiler, 1956; 
| Northway & Rooks, 1956; Tagiuri, 
| 1952; Zeleny, 1956. 


Newcomb | Newcomb, 1961. 
| 
Thibaut & | Adams, 1953; Azrin & Lindsley, 1956; 
Kelley Back, 1951; Blau, 1954; Cohen, 1958; 


Dittes & Kelley, 1956; Hoffman, Fes- 
tinger, & Lawrence, 1954; Kelley & 
Lamb, 1957; Kelley & Volkart, 1952; 
| Kounin, Polansky, Biddle, & Finn, 
| 1956; Lipset, Trow, & Coleman, 1956; 
Mills, 1953; Spector, 1956; Thibaut & 
Coules, 1952; Thibaut & Riecken, 1955. 


potheses) be selected from each conceptual 
position. (The only point of view for which 
the optimum number of relationships was not 
obtained was Moreno’s, due to a lack of 
access to studies relevant to it.) An attempt 
was also made to include only one tested 
relationship from each source study. How- 
ever, where only a small number of studies 
were available, more than one relationship 
was sometimes included; this was necessary 1n 
11 of the 55 studies with particular concen- 
tration in the orientations of Cartwright and 
Zander, and Moreno. It should also be pointed 
out that the only source of experiments based 
on Newcomb’s approach was his recent book 
(1961); however, the hypotheses in this 
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TABLE 2 


CLASSIFICATION OF VARIABLES IN HYPOTHESES 
By Size OF SocIaL UNIT 


Dependent variable 
Independent 


variable w | j si 
Individual Dyad Group Total 
Individual 23 | 1 4 28 
Dyad | 3 10 0 13 
Group w j 27 E2 
Total 40 12 31 83 


study cover all major parts of his conceptual 
system. Finally, only those hypotheses which 
were statistically supported (p < .05) by the 
experiment were included in the analysis. 

In the total sample of selected hypotheses, 
4 variables were unclassifiable, and thus 4 re- 
lationships (8 variables) do not appear in the 
subsequent tables. In addition, 11 other vari- 
ables do not appear because they were minor 
and/or control variables in a multiple-variable 
hypothesis. Thus the total number of rela- 
tionships classified and included in the analy- 
sis equals 83; the total number of variables 
initially studied was 185 (95 independent 
90 dependent). : 


ANALYSIS BY SIZE OF SOCIAL UNIT 


The classification of variables was con- 
ducted in two phases. In the first analysis 
each major variable of a given hypothesis 
was classified according to the size of the 
acting social unit. A variable, by definition 
must be capable of taking on different values, 
and thus represents a class or dimension of 
some kind. In the present analysis, if the 
value of a variable could be determined b 
observing only one individual, the ahe 


CLASSIFICATION OF VARI 


was classified at the “individual level”; if 
it was necessary to observe exactly two indi- 
viduals to determine the value, it was clas- 
sified at the “dyad level”; and if three OF 
more individuals had to be observed, H 
variable was classified at the “group leve: 
Thus, even if a variable is represented DY 
the mean, sum, or any other characterization 
of a set of scores, the variable was called: 
(a) individual, if all the scores were obtamè 
from a single individual, for example, a 
individual’s score on an attitude scale, 
dyad, if all the scores were obtained from 
two individuals, for example, degree gi sien 
larity of A’s and B’s attitudes toward X, a" 
(c) group, if all the scores were shiam 
from three or more individuals, for examp Ss 
the attraction of the members of the group 
for the group. RN 
The results of this classification are a 
sented in Table 2. These data show the m a 
size by unit-size relationships for the t a 
sample of hypotheses; thus, the frequency g 
each cell indicates the total number of pN 
potheses which related an independent var 
able at a given unit level to a de 
variable at a given level. The diagonal 
in the table provide an indication °- ijy 
number of hypotheses which conceptu@ ) 
employ a social unit of the same magnit, 
in the independent and dependent Va". a 
Table 3 presents the distribution of 87 
social unit for each variable in each O g 
tion, regardless of whether they We? ples 
ployed in independent or dependent yana ent 
The results indicate that the indeper esis 
and dependent variables of a given BYP? the 
tend to employ an acting social unit ° fall 
same size (i.e, those hypotheses W ae 


enden 
Ps calls 


TABLE 3 


ABLES BY ORIENTATION 


— = AND SIZE or SocraL UNIT ae 

Unit size zs = i pee e 
Bales eat ieee cma al 

p ; gath > 

eae s È Moreno | Newcomb | T hibat, & pe 

Individual 3 Hien 66 
Dyad 2 10 9 | 16 25 
Group 27 1 12 3 75 
Total | 32 9 5 | 9 166 

7 ees a0 26 28 a 


a 


CONCEPTUAL ORIENTATIONS 


in the diagonal cells of Table 2); this is true 
in 72% of the hypotheses studied. It is also 
evident that the hypotheses analyzed pri- 
marily deal with social units at the individual 
and group levels in both independent (70 
out, of 83, or 84%) and dependent variables 
(85% ). In addition, there is a greater tend- 
ency to employ the group as the social unit 
in independent variables than in dependent 
ones (57%), and to deal with the individual 
unit in dependent variables rather than in 
Independent ones (60% ). Dyads are em- 
Ployed about equally often in both types of 
variables, 

Table 3 indicates even more clearly the 
tendency to use variables dealing with indi- 
viduals and groups; 85% of the variables 
Studied employ these two units. Turning to 
the distribution of units by orientations, 
three conceptual systems give approximately 
equal emphasis to individual and group 
Units: Cartwright and Zander, Homans, and 
Moreno, The other three each tend to em- 
Phasize variables dealing with only one of 
re three units. Thus, a high proportion of 
f e variables derived from Bales’ approach 
vr with groups (84%); variables from 
f ara and _Kelley’s orientation tend to 
z er to individuals (57%); and Newcomb 
a slightly more often with dyadic vari- 

es than with individual or group ones 


(46%), 


ANALYSIS BY TYPE or SOCIAL Process 
AND SUBSTANTIVE CONTENT 


a oe Controversies over conceptual 
chology level orientations in social psy- 
appropriateness issues concerning both the 
: of conceptualizing theory 

S social pro a i 

ploying various subsane t “tt 
should be noted that the Present ata 5 
scheme was not designed to resolve the a 
troversies nor to reflect a particiilar are 
of view, but rather to provide a desc ‘ten 
of the positions implied by the six cane iual 
Schemes selected for investigation, rar = 
ample, the fact that the present system Er- 
mits affective and cognitive states to he oe: 
tributed to either individuals or groups eas 
Not meant to imply that the authors neces- 
Sarily consider either usage appropriate or 
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inappropriate. (For a discussion of the prob- 
lem of personification in the social sciences, 
see Allport, 1962, and Nagel, 1961.) 


Social Process Categories 


Individual Level. At this level of social 
process the individual is treated as the unit 
of conceptualization; there is no specification 
of any relationship between a given indi- 
vidual and other individuals or a group. This 
level includes variables conceptualized as 
states, properties, and behaviors of the indi- 
vidual; for example, hunger, weight, IQ, 
anxiety, etc. 

Relational Level. Here, a one-way rela- 
tionship is specified between a given indi- 
vidual and one or more other individuals or 
a group, for example, A’s love for B, A’s 
influence on B, A’s status in relation to B; 
where A is an individual and B is another 
individual or group. This level includes vari- 
ables dealing with the type and nature of 
such relationships. 

Mutual Level. At this level a mutual rela- 
tionship or interdependency is specified be- 
tween an individual and other individual(s). 
This level includes such variables as the 
number of (dis)agreements between A and B, 
mutuality of attraction between A and B, 
intensity of conflict between A and B; where 
A is an individual and B is another individual 
or group. 

Group Level. Here, the group is specified 
as the unit of conceptualization. This level 
includes variables conceptualized as states, 
properties, and behaviors of the group; that 
is, variables that deal with the group as an 
such as the cohesiveness of A, the 


A, the productivity of A. It also in- 
art of the 


entity, 
size of 
cludes those which deal with one p 
group relative to another. 

© One can also describe group relational and 
group mutual levels, that is, the relationship 
of group A to group B, or between groups 
A and B. However, such conceptualizations 
of group variables did not occur among the 
sample of hypotheses selected for the present 
study. 


Substantive Content Categories 


Affect. This type includes all variables 
which are defined and/or conceptualized as 
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oe affective states, namely, emotional | s lions rae 
bore both positive and negative. The i ae 
Mines of such states can be at any level of P 
H Ptualization; for example, John is un- | RARE 
Mae he does(not) like Harry, John and | : 
doesn 9(not) like each (other, the group E|/SS488 
‘Nie ot) like Harry. Motivational variables — 7 Sree 
included in this category. | A 
win enition, This type includes all variables | g 
States are conceptualized as internal cognitive | 2 
i or Processes of an individual or a | F 
up. Variables dealing with internal beliefs E 
varaa en are of this type. Cognitive i 
zation: ¢ may be at any level of conceptuali- - 
oe or example, Harry believes . . ., the Ki & 
P thinks., £ 5 
Titles This type includes variables ie $ 
Soe refer to the overt behavioral mani- 5 M ee ices ceases 
ations of individuals or groups. It includes a BS 
teva motor acts including verbali- 5 3 D 
RA and direct expressions of affect and J gy 
uch soe except for introspective reports. Fa E Q =, 
Ghat may be at any level of con- @ |] 2) es igi ee = 
Btoup = lon; for example, Harry runs, the 4 5 a 
Baa onvenes, a Z gl wtane 
tite oo This type includes all variables = zi 
ie ae attributes of an individual or a z A ey eR ON 
india ag are defined only in terms of the 7 & 2 | 
eet or group’s relationship ta ather Z E E| nin t t 
such ua h or groups. It includes variables s pi 
S ae e individual’s or the group’s role, a ajomas 
Th 7a se cee ge Position, etc. s ee E 
es ana i i 5 ; 
r at the group tem” eS te = F ollie 
. ri N er oe ee 
hi ae by poe i all variables z = | g asong 
'Stribution of number, location, and Ę 2 menn 
NY Jewel fe, parts in a unit. It can be at g Žž g| ntong 
H 5 
a ian are at rene | P 
ste D the distribution of roles in a group, ° jaa 
x In the variables analyzed, however, this Aa enei 
Occurred only at the group level.) 2a TTS 
aes, 
RESULTS gjora 
sla results of the classification of the 
Drog hypotheses in terms of the social ee 
in cess level and content type are presented 2 Saa 4 
hyp able 4. Each cell entry stands for one È HE 4 a 
ah o thesis which related an independent vari- äi z 2 ŠŠ 
tint of the indicated process level and con- 


type to a dependent variable of the 
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TABLE 6 
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Content category 


Orientation | Affect Cognition Behavior Position | Structure 
1Y. DV] Y LV: | DV.) % RV. | DING | % Tie AN ne 

Bales 1 3| 12| +) 2) 9] 2) 0} 38 f | 4 6 
Cartwright & a a 

Zander 3 5 | 25 8 7 47 1 4+ | 16 2 0 6 2 | 
Homans 4 4 | 29 2 2/14| 4 6 36 4| 0 te, 0 
Moreno 1 1|10| 2 1 15 4 3 | 35 2 4 30 1 
Newcomb | 5 5 | 38 2 5 | 27 0 0 0 6) 0 | 23, 0 
Thibaut & Kelley 8| 2)36) 1) 3] M4) 9| 9|32] 3] o | i | 2| 

Total | 22 | 20 | 25 | 19 | 20 | 24 | 11 | 32 | 26 | 18) 5 | 14} 13 

li ! 


Note.—Percent figures indicate the percentage 


D indi the total number of variables from a given orientation 
classified) which fall in the indicated content category. 


indicated level and type. The letter indicates 
the conceptual framework from which the 
hypothesis was derived. The number in the 
lower right-hand corner of each level-level 
cell indicates the number of hypotheses which 
related those two process levels. Marginal 
row (independent) and column (dependent) 
totals indicate the number of variables found 
for each process level and content type, 
summed over all orientations. 

The results indicate that independent and 
dependent variables tend to be conceptualized 
at the same social process level; independent 
variables at the individual level tend to be 
related to dependent variables at the indi- 
vidual level, and mutual to mutual. Thus 
64% of the hypotheses analyzed fall in the 
diagonal cells of the matrix of Table 4. This 
predisposition to conceptualize the independ- 
ent and dependent variables of a given hy- 
pothesized relationship at the same social 
process level characterizes a majority of the 
hypotheses selected from each orientation 
except Moreno’s, 

Table 5 presents a classification of the 
variables by orientation and by level of social 
process. The data indicate that the greatest 
proportion of the variables are conceptualized 
at the group (34%) and the relational 
(30%) levels. A greater proportion of vari- 
ables conceptualized at the individual level 
are treated as dependent variables (64%) 
and a greater proportion of mutual level 


PE ber 
(or of the total num 


jables 
4 e ariab 
variables are treated as independent va 


(58%). 

Table 5 also indicates that ai 
derived from a given conceptual ga soci | 
to emphasize only one or two levels ° geriveð 
process. Thus, 78% of the eye set f 
from Bales’ approach are conceptu? i ~grt 
the group level, 78% of those fror vidua! 


these’ 


wright and Zander’s are at the 1, ve at 
and group levels, 75% from Homans | fro™ 
the relational and group level, 7 /ryidual 
Moreno’s are at the relational and 1" the 
level, and 88% from Newcomb’s a a enc! 
relational and mutual level. THIS, =A yom 
is less marked in the variables daw hes 
Thibaut and Kelley’s system: 649 elatio”? 
variables are at the individual a” N 
levels. rie? 


ego 
Table 6 shows the use of content = jeve 
across orientations and social proc’ 
The results indicate that variables 
alized in terms of overt behavi0! 
frequently employed as dependent 
(74%) than as independent 
whereas a smaller proportion viable je! 
(22%) and structure (32%) va jes: Me 
conceptualized as dependent pane ofte” 
and cognition are treated equ@ ples: 
independent and dependent a diffi 
Table 6 further indicates ize o 
conceptual systems tend to e™P 
ferent types of content, Note 


Bales’ variables fall within the 
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ened categories, 72% of Cartwright and 

650 p fall within affect and cognition, 

654, e omais within affect and behavior, 

ate. nt Morenita within behavior and po- 

ia 5% of Newcomb’s within affect and 

vibe on, and 685% of Thibaut and Kelley’s 
In affect and behavior. 


Discussion 


co these six major orientations 
TR oy of small group behavior, con- 
one i oss Sion | was discovered in the 
flayed ua approaches. The variables em- 
a Pn representative hypotheses differed 
beste . size of the acting unit, the con- 
Pipe evel of social process, and the type 
ki stantive content emphasized. Despite 
‘Sen ee i there do exist some com- 
i endencies. The results indicated a tend- 
hs oe all the hypotheses classified to 
inde € same sized social units (72%) for the 
Oe and dependent variable of a 
tide eae The independent and de- 
ee evels of social process were also 
Both o be at similar levels (649%). Fur- 
: alanis was a tendency to employ 
thie pn ual more often as the acting social 
(sae, ) Jo) and as the level of social process 
Sead ann the dependent side of a hypothe- 
Wate i The latter predisposition is 
Social ne v the historic orientation of 
vidue a ology towards viewing an indi- 
Social ch enaMiO: Hg dependent upon given 
characteristics of the environment. 


In c i 
ompari i i 
of i paring the conceptual orientations 


of findin 


; 4 Ppro 
alized primarily at he oe es 
ess leve 


Which is consistent wit 
T emmo, whereas nE m E 
ee and Zander’s are primarily con a 
zed at either the group or individual hot 
me latter is congruent with Lewis (os) 
th Phasis upon the social behavior of ri 
i. individual and the group, and a lack of 
scoretical concern about the nature of the 
i cial process (relational or mutual) þe- 
ĉen individuals. In contrast, the remaining 
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orientations (Homans, Moreno, Newcomb, 
Thibaut, and Kelley) place rather heavy em- 
phasis upon the dynamics of the interper- 
sonal relationship per se. It should be noted 
that despite repeated assertions by social 
psychologists that the most critical area for 
investigation in small group behavior is the 
reciprocity of the interaction process, only 
those hypotheses stemming from Newcomb’s 
orientation had a substantial number of vari- 
ables conceptualized at the mutual level of 
social process. 

In the selection of substantive variables, 
those hypotheses which derive from the more 
sociological points of view (Bales and 
Moreno) were found to select more variables 
referring to position, structure, and overt be- 
havior than the other orientations. This em- 
phasis may in part be due to the fact that 
it is more appropriate in sociology to talk 
about various characteristics of groups and 
their behavior rather than affect or cognition. 
In both Homans and Thibaut and Kelley one 
observes the heaviest stress upon the substan- 
tive variables of affect and behavior which 
is consistent with their reward-cost (rein- 
forcement) formulations of small group be- 
havior. Finally, the substantive variables 
most characteristic of both Newcomb’s and 
Cartwight and Zander’s conceptual orienta- 
tions were those of affect and cognition which 
reflects their stress upon the importance of 
attitudinal and perceptual variables in social 
behavior. 

It is of interest to note in the present 
analysis those levels of social process which 
are not related frequently in the present set 
of sampled hypotheses. Thus, there are few, 
if any, hypotheses which utilize variables at 
the individual process level as independent, 
and mutual level variables as dependent vari- 
ables and vice versa. A study of research 
ing from more clinically 
theories would probably 
reveal a substantial number of hypotheses 
at these levels. Thus, this would appear to 
be an area in which greater rapprochement 
between the two specialties might be 
achieved. 

Furthermore, there are relatively few 0c- 
casions when variables at the individual 
process level are treated as the independent 


investigations stemm 
oriented ‘personality 
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and at the group level are treated as 
the dependent—that is, what character- 
istics of the individual affect the group? As 
noted previously, there are strong historical 
precedents to state hypotheses in the opposite 
direction, namely, from group to individual 
level. Although a more powerful explanatory 
model could undoubtedly be achieved by em- 
ploying relational or mutual variables as 
independent variables when the group is 
treated as a dependent variable, it is of inter- 
est to note the continuous prominence which 
the individual is afforded in past and con- 
temporary social psychological theory. 

In conclusion, it should be emphasized 
that the present analysis was based upon a 
sample of empirical studies selected to repre- 
sent various conceptual approaches to the 
study of small group behavior. The results 
of this analysis are limited by the sample 
chosen, although the authors feel that this 
sample was adequate and representative, In 
addition, there is often a discrepancy be- 
tween the comprehensiveness of the formula- 
tions of a given conceptualization and the 
range of empirical studies deriving from that 
orientation. Thus, most orientations in social 
psychology conceptually deal with mutual re- 
lationships more frequently than would ap- 
pear to be the case by such an examination 
of the empirical literature, The lack of Te- 
search focused upon the interdependencies or 
mutualities of Interpersonal behavior may re- 
sult from an inability to state precisely the 
conceptual nature of these relationships or 
from present methodological difficulties in 
measuring reciprocal behavior. Finally, it 
should be noted that conceptualization often 
toll open mle of empirically 

i H ay thus be somewhat 
deficient in representing the content of a 
conceptual system. 
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Various possible meanings of “phonetic symbolism” are discussed. Phonetic 
symbolism is distinguished from onomatopoeia. “Elemental” and “structural” 
phonetic symbolism are defined. Elemental phonetic symbolism is discussed 
m terms of 7 hierarchically arranged questions which define “subjective” 
phonetic symbolism (that detected by observers) and “objective” phonetic 
symbolism (overrepresentation of particular sounds in words of particular 
connotations in natural languages). Experimental and empirical evidence 
relevant to each question are discussed, and it is concluded that both subjective 
and objective elemental phonetic symbolism are real phenomena, but that the 
patterns of symbolism are unrelated in historically unrelated languages. The 
feedback theory of phonetic symbolism (Taylor, 1963) is considered in greater 
detail than heretofore. The feedback theory asserts that if speakers of a 
language show subjective phonetic symbolism, then the language will come to 
show the same pattern of objective phonetic symbolism, and that the pattern 
of objective phonetic symbolism in the language similarly influences the de- 
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velopment of subjective phonetic symbolism in the speakers. 


according to the hypothesis of Universal 
y etic Symbolism (UPS), sounds have 
all ne connotations which are the same in 
| Mba Taylor (1963), rejecting this 
that nis as no longer tenable, contended 
ly on ee symbolism was strong in prob- 
h Aa languages, but that the actual con- 
Saes of sounds differed among unrelated 
ne ar ee further proposed a feedback 
bolis, © anguage-dependent phonetic sym- 
mon ciss oe has criticized both Taylor’s 
er Bers, aa the UPS hypothesis and 
ay be hough Weiss’ specific criticisms 
ay answered, they exemplify 
eanings ef es among the different 
his Daner ee term phonetic symbolism. 
ication of these usage ification and lassi- 
Tene ges. In : 
‘age: anew the See oe 
A of phonetic Symbolism, and pr 

e feedback theory in extended DS PS 


Wuar Is Puonetic SYMBOLISM? 
h P . 
tonetic Symbolism and Onomatopoeia 


„Several studies, following Sapir (1929) 
€ required subjects to judge nonsense 
i s 

i The opinions expressed in this paper are those 

it authors and do not necessarily represent the 

‘ten of the Canadian Department of National 

ce, 
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syllables as if they referred to various at- 
tributes, such as size, brightness, pleasantness, 
and so forth. The best known result of these 
experiments is that for English speakers the 
vowel influences the “feel” of a nonsense 
syllable in respect to size. High vowels, such 
as /ī/, tend to give a feel of smallness, while 
lower vowels tend to give a feel of greater 
size. The favorite rationale for the effect of 
vowels on the size feel of a nonsense syllable 
has been that small objects tend to make 
higher pitched sounds than do larger objects. 
High-pitched vowels, therefore, seem ap- 
propriate for small objects. The sound sym- 
bolism is, according to this rationale, a subtle 
form of onomatopoeia. 

Onomatopoeia is defined in dictionaries as 
the generation of words in imitation of the 
sounds made by the thing or action desig- 
nated. The relationship in the literature be- 
tween phonetic symbolism and onomatopoeia 
can be at best described as confusing. At one 
extreme, phonetic symbolism was discussed 
under the heading onomatopoeia without ever 
being labeled as phonetic symbolism by 
Osgood (1965), as though phonetic sym- 
bolism is onomatopoeia. At the other extreme, 
Brown (1958) clearly distinguished the two 
phenomena: 


We all know about onomatopoeia—the imitation of 
non-speech sound in specch. But this is not onoma- 
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topoeia. Here speech sounds suggest spatial and 
visual dimensions, size and brightness [p. 114]. 


Normally speaking, one of the more strik- 
ing characteristics of natural sounds is 
rhythm or lack of rhythm. Hence, onomato- 
poetic words tend to be rhythmic or not, de- 
pending on the thing designated. It is, a priori, 
reasonable to suppose that the nonrhythmic 
symbolism that connects size with vowel 
sounds is also a form of onomatopoeia. How- 
ever, it is not so reasonable to suppose that 
onomatopoeia generates the affinities that have 
been found between consonants and size, or 
between vowels and, say, pleasantness. Neither 
is it reasonable to suppose that the same ono- 
matopoetic effect makes Tamil and Japanese 
speakers feel /1/ to be appropriate to large 
objects and /U/ to small, unless in these 
countries large, instead of small, objects emit 
high-pitched sounds. 


Then, there are the results of Eberhardt’s (1940) 
experiments with deaf children to be accounted for. 
Eberhardt, using the Sapir-Newman procedure with 
English reading deaf children obtained similar order- 
ings of English vowels as for hearing (English- 
speaking) subjects. The deaf subjects obviously 
could not have based their judgment on the tonal 
quality of the sounds [Taylor, 1963, p. 206]. 


There is obviously a sound symbolism that 
is not related to onomatopoeia. This is the 
effect called “phonetic symbolism,” 


Elemental and Structural Phonetic Symbolism 


Thus far, we have considered both phonetic 
symbolism and onomatopoeia as if their ef- 
fects depended only on the sounds in the 
words or syllables, without consideration of 
the order or relationship of the sounds. In 
Sapir’s material the structural pattern C-C 
(initial consonant and final consonant) of 
“m-I? remained constant while one sound 
element, the middle vowel, was varied, so 


that we can safely attribute the varying size 
connotations of the s 


yllables to the varying 
sound element. 

It is difficult to determine whether or not 
elemental onomatopoeia exists. The words 
that seem particularly appropriate for de- 
scription of things with clear acoustic con- 
comitants do not necessarily contain sounds 
like those they describe. Consider the low 
vowels in “cock-a-doodle-doo” and compare 
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them to the high screech that the bag i 
supposed to imitate. In other mg oe oh dit- 
same screech is expressed with entirely 
ferent sounds. te 
Experimentally, one cannot dema ae 
onomatopoeia of any kind, because sym- 
ever-present possibility of pe in e 
bolism. By chance, or for cause, the holism 
appropriate by reason of phonetic syn asil 
may be like the sounds described. opri- 
analysis would suggest them to be Be os 
ate by reason of onomatopoeia. One oats it. 
illustrate onomatopoeia, never demons 


: le- 
2 5 n of Os, 
While experimental demonstration topoeit 


ation 0 
i ars 
structural phonetic symbolism appe 
Structural piane 
bolism may be distinguised from § 


for *, 
referen 


structure? So far as we are aware, muestiO a 

have been directly aimed at these a i 
We shall enquire in this paP Ss m olism | 

questions of elemental phonetic Toil a 

though the other questions will, ide 0” th 

to time, be raised where they int" 

main topic of enquiry. 
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+. SY 
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The effects of elemental phor gunds od 
bolism are nominally due to t 


|e 80" oin 
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ee 
words. However, as has often P 
out 


= 
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is 

ible sounds ? 

... the total number of possible Indee 1 act uv? 
excess of those actually in use- di 


se 

5 e 

enced phonetician should have a to ob! 
inventing sounds that are see oan aft 
investigation [Sapir, 1921, pp- a eat ptt 


h 
re 
Furthermore, the sounds ae ound’ 
far fewer in number than t 

are produced: 


Under normal circumstances, in the 
people do not distinguish differen pon? 
lophones . | , . Consequently the ding of (5* 
small to be a unit in the enco“ ie js 
Process, implying that the phone 

1965, p. 62], 
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Ma only do listeners fail to hear fairly large 
cifferencos in spoken sounds, they also hear 
Onexistent differences between two identical 
th te in different contexts (Broadbent & 
os) 1960; Ladefoged & Broadbent, 
All these failures of hearing imply that 
i natural unit for analysis of elemental 
o ieii, symbolism is the phoneme. All stud- 
lanaa acoustic stimulation, save possibly 
E of Miron (1960, 1961), have indiscrimi- 
ately allowed allophonic variation within 
tile Presented phonemes. Similarly, studies 
'§ Written materials have of necessity used 
it Onemes as the units of analysis. In a sense, 
Would be better to talk of “phonemic” 
mbolism, to allay the suspicion that we 
ight actually be enquiring into the sym- 
olism of speech sounds as such. 


Py ; ; ; 
; oblems in Comparing Phonetic Symbolism 
cross Languages - 


oi Miis that belong to a single phoneme in 
tes anguage may be divided among two or 
tea phonemes in another. Sounds that are 
me emically valid in one language may not 
R occur in another, It is impossible to 
in e phonemes of one language in a 
Pa 7 meaningful way onto those of an- 
tail : hough phonemes do not agree in de- 
cma or languages, the main features of 
PA S gee match well enough across 
baa Ai at they may be meaningfully 
Phone r some purposes as the same 
zy me, The English /t/ and the Korean 
sais oh include the same roster of 
stopped cine 1S an unvoiced apico-dental 
wold oun nis Most allophones of / cf 

acceptable variants of /t/ 


most language pairs i 
Kinema, alia Neude many 
Since the phoneme library of o 
does not match that of another language, į 
Ag ‘Meaningless to enquire about dong ta. 
Siversal phonetic symbolism. However, it A 
if 0 im to consider whether phonetic sym- 
S ‘sm gives to matched phonemes the same 
t €l in both languages. It is also meaningful 
ask questions in which the unit of analysis 


ne language 
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is more gross than the phoneme. For example, 
the results of Sapir (1929), quoted above, 
suggest the generalization that low vowels 
give large feel to nonsense syllables. Such a 
generalization gives rise to a valid question in 
any language that uses vowels, without imply- 
ing a vowel to vowel correspondence among 
languages. Phonetic symbolism could in prin- 
ciple be universal in terms of generalizations 
of this type, even though direct phoneme 
matches are not possible over all languages. 


Phonetic Symbolism Felt by Observers, and 
Phonetic Symbolism Existing in 
Natural Languages 


One question of phonetic symbolism is 
whether some sounds feel appropriate to 
certain concepts even in nonsense syllables. 
Sapir’s finding that /t/ in mr seems ap- 
propriate to the concept of “smallness” sup- 
plies a partial answer. A separate question is: 
“Are some sounds found proportionately 
more in words of a certain meaning than in 
words of other meanings?” For example, 
Jespersen (1922) noted the frequent occur- 
rence of the letter ¿ in English words with 
the connotation of “small,” and of x in words 
with the connotation of “contemptuous.” 
These two questions can be asked, and an- 
swered, independently of each other, although 
even recently Miron (1960) and Weiss 
(1964) discuss these two versions of phonetic 
symbolism as if they were logically depend- 
ent. But there is no a priori reason why the 
English language should use 7 more among 
small than large words, even if English 
speakers find that /1/ gives nonsense words 
a small feel. 

If both subjective phonetic symbolism 
(symbolism felt by observers) and objective 
phonetic symbolism (based on the distribu- 
tion of sounds in a language) are demon- 
strated, then we can ask: “Is the sound felt 
to be appropriate to certain concepts in non- 
sense syllables also found in real words of 
the same concept more often than chance 
would indicate?” For example, /1/ is shown 
by Sapir to be appropriate to “smallness 
in nonsense syllables and is noted by Jepersen 
to occur frequently in “small” English words. 
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TABLE 1 


SPECIFIC EXAMPLES OF SEVEN QUESTIONS OF ELEMENTAL PHONETIC SYMBOLISM 


ivi TAA i ations 
A. Association felt by individuals between individual sounds and their symbolic connotatio: 
a in arbitrary contexts (subjective phonetic symbolism). 


——— 


á Me as in MIK, 
1 individual Does John Smith always feel that /1/ in a nonsense syllable, as ! 
. An 


makes the syllable sound small? 


2. Speakers of one language com- 


oer a i > 
Do individuals speaking the same language agree on (1)? 
munity 


f Rai d small? 
Do all English-speaking people agree that /1/ in mıx makes it soun 
speakers 
3. Speakers of unrelated languages Do individuals speaking unrelated languages agree on (1)? Dosp 


R i < kes it soune 
of Japanese, Korean, Tamil, etc., all agree that /1/ in aK make 
small? 


ee 


bani x r é z ages 
B. Frequent occurrence of individual sounds in words of certain concepts in natural languag 
(objective phonetic symbolism). 


oe ror ds 
Foch . easant’”? Wor 
4. Within a language In English, is /A/ found more often than chance in “unpleasat 


such as Grumble? 


lorean, 
ğ Da Yh se, Kor 
5. Across unrelated languages Is the same /A/ found too often in unpleasant words in Japanese, 


Tamil, etc.? 


C. Correspondence between (A) and (B), or subjective and objective phonetic symbolism. 


also 
6. Within a language 


Is /1/, which is felt by En 


S mall, 
glish speakers to make arx sound $ 
found too often in “small” 


words in English? 


~ 


Jso 
oe all, % 
. Across unrelated languages Is /1/, which is felt by English speakers to make MIK sound sma" 


; mo mil, ete 
found too often in “small” words in Japanese, Korean, Tamil, 


° y 
: ; ' -ticula 
Seven Questions of Phonetic Symbolism 4. In a single language, do ie 


z rds 4° 

When subjective phonetic symbolism and phonemes occur more often in M i i 

objective (language content) phonetic sym- SOciated with certain concepts ate par; 

bolism are considered as independent sub- Words of the language as a W net 
jects of enquiry, there ar 


e seven questions 


a particular language show objective P 
the generality of 


symbolism?) + onlay 
5. Do unrelated languages P 

same objective phonetic symbolism m alist 
6. Does the subjective phonetic ie 

displayed by speakers of one langue of 

the same pattern as the objective 


that may be asked about 
phonetic symbolism: 

1. Does a particular individual 
sistent feel 
containing 


find a con- 
in words or nonsense syllables 
certain phonemes when these 
words are judged in respect of certain at- 
tributes? (Can subjective phonetic symbol- 


symbolism found in that language? äl is 

ism be demonstrated within an individual?) 7. Does the subjective phonetic a anc 

2. Can consistent subjective phonetic sym- displayed by speakers of one pe eos pone, 

bolism be demonstrated across individuals of the same pattern as the objecti? gual y 

a single language community? symbolism found in an unrelated 187 s Wg 
3. Can consistent subjective phonetic sym- 


bolism be demonstrated across individuals 
speaking unrelated languages? 


tion? „ye 
The import of these general ae isp! 
be clarified by the specific examP 
in Table 1, 
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fi SUBJECTIVE OBJECTIVE 


PHONETIC SYMBOLISM PHONETIC SYMBOLISM 


4 


Within 
an Individual 


4 
x 2 4 
Between Among 
Speakers of the Words of 
language A language A 


6 


Between 
Speakers of 
language A 
1 and 
t the Words of 
language A 
: 
Between Between 
Speakers of the Words of 
language A language A 
and and | 
Speakers of the Words of 
unrelated unrelated UPS 
language B language B 
7 | 


Between 
Speakers of 
language A 
and 
the Words of 
unrelated 


language B 4 


es Logical relationships among the seven questions of elemental phonetic symbolism. 
lis tev gestions axe, CORDEn by a line, a yes answer to the lower question implies a yes 
answer to the higher, and a no answer to the higher implies a no answer to the lower.) 
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The questions form a hierarchic tree as in 
Figure 1. Three questions (1-3) deal with 
successively more general questions of sub- 
jective phonetic symbolism. Answers to any 
of these questions imply answers to others. 
For example, if subjective phonetic symbol- 
ism could not be demonstrated in any in- 
dividual, then it would be meaningless to 
ask whether it had the same pattern for in- 
dividuals speaking unrelated languages. Two 
questions (4 and 5) deal with objective 
phonetic symbolism, and they are related to 
each other in the same way as Questions 2 
and 3. Two questions (6 and 7) connect 
subjective and objective phonetic symbolism. 
Question 7 is meaningless if any one of Ques- 
tions 3, 5, and 6 is answered negatively. 
Similarly, a positive answer to Question 7 
implies a positive answer to all the other 
six questions. 

The seven questions, together with the 
phonetic symbolism-onomatopoeia and the 
elemental-structural dichotomies, serve to de- 
fine some of the meanings of what we take 
to be phonetic symbolism. 

We shall discuss the evidence regarding 
elemental phonetic symbolism in terms of 
each question separately. 


EVIDENCE ON ELEMENTAL PHONETIC 
SYMBOLISM 


Much of the evidence has been reviewed 
by Taylor (1963), but without the present 
classification scheme. A new approach to 
some of the evidence and inclusion 
published evidence are required bot 
new classification scheme, and 
pearance of Weiss’ (1964) 
earlier review. We shall c 
which we or other author: 
relevant to the seven quest! 
phonetic symbolism. Evidence is considered 
In respect of each question in turn, though 
because of the logical relationships among 
the questions, clear answers to some questions 
provide evidence for the discussion of other 
questions. 


of newly 
h by the 
by the ap- 
criticisms of the 
onsider evidence 
S suppose to be 
ions of elemental 


Subjective Phonetic Symbolism 


QUESTION 1: Can subjective phonetic 
symbolism be demonstrated for a single in- 
dividual? 
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Question 1 has not been pagers 
studied. However, the very clear ponid- 
answer to Question 2 confirms that ind! - 
uals do demonstrate subjective phonetic sy 
bolism. 3 

Question 2: Do individuals speaking 1 
same language show the same pattern of 
jective phonetic symbolism? i of 

This question has been the § ome 
several studies (Dagiri, 1958; a ie lor 
1961; Newman, 1933; Sapir, 1929) "ne 
& Taylor, 1962), all of which use ali- 
analytic method to determine the po o 
tions of individual phonemes, and CA G 
which produced a decisive positive ité 
All the studies included the size ee 
Newman added brightness, Taylor an jeasant- 
included movement, warmth, and P ibutes 
ness, while Miron used 15 different ce is 
All studies except Dagiri’s included 
speakers, while Dagiri, Miron, an wit 
and Taylor used Japanese speakers age. 
different degrees of English kaom and 
Taylor and Taylor also used TAM an 
Korean speakers. In the five l aa en 
quoted, there have been 51 ianey con- 
tests of the consistency on aier sym” 
ceptual scales of subjective phone n post” 
bolism across subjects, All have show eakin’ 
tive correlations across speakers a ey 
the same language, and only tw® 
sibly three, conceptual scales have 
give strong correlations (Mirom a 
speakers, “heavy-light” and °°? 
Taylor and Taylor, Tamil speakers, „oy the 
ment”). This overwhelming support sugs' 
effectiveness of phonetic symbolisn ature 
that the symbolism is a pervasive ete n 
language behavior, It is not haa spjec™ 
this or that attribute, or to traine ang 
or to speakers of this or that a (A 
Wherever we look, we find elemé 
jective phonetic symbolism. ° 

ieee 3: os individuals sein 
unrelated languages agree on 
tions of a given sound? a, 8 
? After this manuscript was complet oof 
(1965) kindly sent a prepublicatio® P'he aM ys 
article in which he used a technique Pi fI on 
method described for Question 75 oles 0 ed OH 
arbitrary sound complexes formed the pe Pe 
ment scales, with meaningful words vo tbe 
the scales. The results again suppor 
answer to Question 2. 
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Question 3 has been attacked directly by 
Miron (1960, 1961), Davis (1961), and 
Taylor and Taylor (1962). Weiss (1964) 
Presents the Miron and Davis experiments 
as contradicting Taylor and Taylor’s finding 
that speakers of different languages do not 
agree. We do not agree that there is any 
Contradiction among the results of the three 
Studies, 


__ 1. Davis asked English- and Swahili-speak- 
ng children to match the words TAKETE and 
ULoomu, nonsense in each language, to two 
drawings, Both groups of children agreed 
that TAKETE went with the angular drawing. 

For several reasons, this result can carry 
Very little weight in a discussion of phonetic 
Symbolism. First, only one word pair was used, 
So that the chance of agreement between 
Stoups was .5 a priori, if each group showed 
Internal agreement. Secondly, TAKETE and 
ULoomu differ not only in their phonetic ele- 
ments, but also in their structure (CVCVCV 
Versus VCVCV); and hence in their rhythmic 
effects. The results may be valid, but one 
cannot conclude unequivocally that the ele- 
Mental phonetic factor was responsible for 
the results. Thirdly, the words are almost 
Onomatopoetic (probably this is why the two 
Words were selected): TAKETE is similar to 
the sound made by a small angular object 
bouncing around, while uLoomu is less simi- 
lar, but surely suggestive of a soft or liquid 
mass moving slowly. 

2. Miron (1960, 1961) and Taylor and 
Taylor (1962) used very similar techniques. 
Miron asked English and Japanese speakers 
uien to spoken CVC syllables, mostly 
Pie se, and to judge the syllables on 15 

s of meaning, He found an overall av 
age correlation of is 
of .55 bet 

and the Japanese | A ween the English 
was highly significant ee hg correlation 

> enabling him to con- 
clude that there was considerable communal 
ity of symbolism between the groups. i 

Like Miron, Taylor and Taylor presented 
CVC syllables, mostly nonsense, to speakers 
of English, Japanese, Korean, and Tamil, and 
required them to judge the syllables on four 
scales of meaning. Their syllables were 
written with the appropriate alphabets or 
Syllabaries. They computed the symbolic 
Value on each conceptual scale of each con- 


sonant in the initial and final positions sepa- 
rately, and of the vowels, and correlated the 
values across pairs of languages. They found 
correlations of about .24 between English and 
each other languages (p < .05). Had they, 
like Miron, used only English and Japanese 
they also would have concluded that there 
was a common factor in phonetic symbolism. 
However, they found no significant correlation 
between pairs of languages not including 
English. They attributed the positive cor- 
relations between English and each other 
language to the fact that subjects were not 
truly monolingual, having been exposed in 
various ways to English (United States mili- 
tary and civilian personnel frequent Tokyo 
and Seoul to such an extent that some shop 
signs are in English in each city; English 
is an official language in India, and English 
is a school subject in all three countries.) 
Where there was no such direct exposure, 
between Tamil and Korean, Japanese and 
Tamil, or Japanese and Korean, the correla- 
tions were essentially zero. Thus, unlike 
Miron, Taylor and Taylor concluded that 
there was no commonality of symbolism 
among the languages, and that the strong 
phonetic symbolism displayed by each group 
must be language-dependent. 

The detectable effect of minor contamina- 
tion by English exposure of Japanese children 
in Tokyo must raise suspicions about Miron’s 
higher correlations between Japanese and 
English scores. Miron used Japanese aca- 
demics who had, on the average, been in 
the United States more than a year and 
hence had been exposed far more strongly to 
English than had Taylor and Taylor’s sub- 
jects. The two studies considered together 
do not contradict one another. They do sug- 
gest that further studies of subjective pho- 
netic symbolism must control the degree to 
which subjects have been exposed to lan- 
guages other than their own, especially to 
other test languages. 

The answer to Question 3 seems to be 
“no.” 


Auditory versus Visual Presentation of Test 
Material 


The question of presentation method is 
pertinent for Question 7, but arises for the 
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first time under Question 3. It will be con- 
sidered here only, for both types of study. 
1. Auditorily presented real words used in 
word-matching experiments are usually pro- 
nounced by native speakers and tape-re- 
corded. Real words have meanings, and these 
meanings may influence the native speakers 
who pronounce the words. According to 
Eberhardt (1940), who used deaf subjects, 
if the deaf were unaware of the meanings of 
TAP and POUND, they spoke the two words 
with little difference, but once they learned 
the meanings, TAP became lighter and softer, 
while pounn took on a “richer resonance.” 
In addition to the meanings of the words, 
the speaker’s own susceptibility to the sym- 
bolic connotations of words or nonsense syl- 
lables may influence his way of pronouncing 
the words. Jespersen (1922) observed that we 
have a tendency to lengthen and strengthen 
single sounds under the influence of strong 
feeling or in order to intensify the effect of 
the spoken word; thus, in “extremely LONG” 
the vowel O may be lengthened. As a speaker 
is not usually aware of these influences or of 
the purpose of the experiment (as is usually 
the case in word-matching experiments), he 
would not be expected to eliminate them. 
There have been two experiments in which 
different methods of presentation were di- 
rectly compared. Both were word-matching 
studies (see Question 7, below) and He 
results are therefore to be interpreted with 
caution. Brown, Black, and Horowitz (1955) 
compared two methods: visual alone and 
audio-visual combined. They concluded that 
the two methods Produced similar results, 
re ene oa sate) items in three 
guages ranging from .55 to .61 
one ied ee fhe visual-alone 
g s verall performance slightly 
more successful than the audio-visual com- 
bined group. Brackbill and Little (1957) 
compared three methods: auditory, visual 
and audio-visual. The three methods differed 
slightly (p<.05) in producing accurate 
matching, with the audio-visual and visual 
groups being superior to the audio group. In 
the English-foreign languages comparison, the 
visual group showed a tendency to a higher 
percentage of correct guesses; but in the 
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foreign-foreign language matches, the op- 
posite seemed to hold. : 

Ertel and Dorst (1956) used only auditory 
presentation, and suggested in discussion of 
their positive results that expressive voce 
quality of speakers could have influenced the 
subjects’ responses. 

2. Auditorily presented nonsense syllables 
are not always totally nonsense. They can be 
assigned association values and are meaning- 
ful in various degrees. Their associations May 
influence the pronouncer of the syllables. 
When the pronouncer is also the investigator 
as were Sapir (1929) and Newman (1933); 
the investigator’s own hypothesis on phonetic 
symbolism also may influence the expressive 
quality of pronunciation, Newman showe 
that the symbolic value of accenting te 
vowel is heavily in favor of largeness ner 
English speakers. it 

When several languages are involved, p 
becomes almost impossible to prone 
sounds that will be equally valid phoner™ 
ically in all the languages. The pronounce 
can entirely disregard phonemic systems ry 
the languages and produce some Se be 
sounds, but these sounds may turn out to A 
closer to phonemes in some languages tha” ly 
others. In Miron’s (1960, 1961) study: ony 


two languages were involved. Even ait 
trained phonetician had a delicate a" iy 


cult task of attempting a compromis se 
producing sounds so that the prono" gn- 
sounds would not favor either of mi at- 
guages, Japanese or English. What mi 


o 
tempt at a compromise achieved is 
know. eig 
3. Visually presented real words of forged 
languages are usually spelled in Rom ject“ 
alphabets for the English-speaking Shi cat 
Some languages have sound systems pisma in 
be more closely approximated than ° 


such a Romanization attempt. ires pi 
One of the most conspicuous p c a 
real words is visual length. An ao nf 
tempt has been made by Brown, su? 
Horowitz (1955) to determine if tc nee 
length could have influenced the i i i 
By examining the data, they i ngt ie 
subjects did not rely on visual ig OM set 
match foreign words to English WTCE. . af! 


other hand, Brackbill and Little (19° 


Á 
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examining their data for the factor of visual 
length and other factors such as vowels, con- 
Sonant types, spacing of compound words, 
and connotation; found that all these factors 
markedly influenced the agreement among 
the subjects. As a consequence, Brown et al. 
concluded that universal phonetic symbolism 
existed, while Brackbill and Little concluded 
contrary to the hypothesis. 

4. Visually presented nonsense syllables 
have not, as yet, been widely used. So far 
apparently only Taylor and Taylor (1962) 
have employed written stimuli in an analyti- 
cal study. For their experiment, Taylor and 
Taylor prepared the written stimulus syllables 
in four different languages using the alphabets 
and syllabaries of the languages, 

One of the dimensions on which the syl- 
lables were judged was size, and visual length 
Was investigated in one of the preliminary 
experiments with English-speaking subjects. 
Syllables written in a longer form were 
Judged as having “bigger” connotations. How- 
ever, visual length is easy to control, and in 

he main experiment it was equalized in all 
the languages. 

Even after visual length has been taken 
Care of, there remains the differences in visual 
forms of different letters in one language and 
of phonemes under comparison in different 
languages. Could these differences in visual 
forms of phonemes have influenced subjects’ 
Judgments of symbolic connotations of 
Phonemes? One can never unconfound the 
effects due to pure phonemes and possible 
effects due to these visual forms of phonemes, 
as long as the material is visually presented. 

Bs Taylor and Taylor’s graphically 
resented study is compared with the au- 


sounds in com 7 
rather good ine This ve seem dc) 
5 - this observation sug- 
gests that perhaps the two methods of pre- 
sentation do not differ too markedly, 

It is hardly worthwhile to generalize from 
the results of comparisons involving real 
Words to comparisons involving nonsense 
Syllables. For one thing, the results from real 
Words are conflicting; and secondly, the con- 
dition of comparison is somewhat different in 
the studies involving two types of test ma- 


terial. Unlike nonsense syllables which can 
be altered in a controlled way, the numerous 
features of real words cannot be controlled 
before a study but must be examined, often 
haphazardly, after the study is completed. 


Objective Phonetic Symbolism 


QUESTION 4: Does a particular language 
show objective phonetic symbolism? 

Jespersen (1922) pointed out that in many 
Indo-European languages /1/ is used in a 
high proportion of the words connoting small- 
ness, and //\/ (as in Bur in English) in 
words with an undesirable or contemptuous 
connotation. This is an intuitive affirmative 
answer to Question 4 in respect to English 
and other Indo-European languages. House- 
holder (1946) concurred in respect to //A/, 
but his technique has been called “dubious” 
(Miron, 1960). Marchand (1958, 1959) has 
extended to over a hundred the list of con- 
cept-tied phonemes and phoneme clusters in 
English, but the technique remains subjective. 
With presently available computing equip- 
ment, and the developing apparatus of ma- 
chine translation, it might be possible to give 
a more satisfactory direct answer to Ques- 
tion 4, but for the present we must be satis- 
fied with a tentative “yes.” 

Question 5: Do unrelated languages dis- 
play the same objective phonetic symbolism? 

Tf Questions 2 and 4 are answered affirma- 
tively, yes answers are in principle possible 
to Questions 5 and 6. No evidence has been 
produced relevant to Question 5 because 
languages unrelated to English have not been 
examined in the same detail as have the Indo- 
European languages. 


Correspondence between Subjective and 
Objective Phonetic Symbolism 


QUESTION 6: Is the subjective phonetic 
symbolism of a language community reflected 
in words of that language? 

Question 6 has been considered by New- 
man (1933), Berko and Brown (cited by 
Brown, 1958), and Taylor (1963). Newman 
produced a list of words having connotations 
of smallness and of largeness, and examined 
all the sounds of which they were composed 
in the light of his analytical determinations 
of their symbolic size values. He concluded 
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that the subjective phonetic symbolism of 
his observers was not reflected in the words 
of English. Berko and Brown considered the 
frequency of usage of words, and agreed with 
Newman. 

Taylor used the result of Taylor and 
Taylor (1962) that the symbolic value of a 
sound depends to some extent on its position 
in the word, and used their values for the 
size symbolism of the initial consonants in 
testing Newman’s list. She concluded that 
“small” consonants were more likely than 
“large” consonants to start “small” words, 
and vice versa for large words. Weiss criti- 
cized this finding on the grounds that the 
words in the list might have been specially 
selected. The criticism is not valid. To make 
the list, Newman found all the words in 
Roget’s Thesaurus under the headings “Great- 
ness, Smallness, Size, and Littleness” and 
struck out “the repetitions, derivatives and 
phrases.” Eleven judges were asked to assess 
the 500 words remaining, and a majority 
decision was taken as to whether each word 
belonged in either list and, if so, in which list 
it belonged. Selection on a phonetic basis 
seems unlikely. 

Taylor originally considered for illustrative 
purposes only the initial consonants in New- 
man’s list. Since then, we have considered 
the first vowel in Newman’s initial consonant 
words, using the vowel scores from Taylor 
and Taylors CVC syllables. According to 
these scores, /1/ and /e/ are the small vowels 
/u/ and /0/ the large. Of the words in 
Newman’s list having small vowels as their 
first, 78 are “small” and 37 “large.” Of the 
ia having large vowels as their first, 9 

? « 
ran m ep Te ei 
atio. ore marked than that for 
the initial consonants, and is highly signi 
3 gnifi- 
cant (p < .002). This result again suggests 
an affirmative answer to Question 6 
far as the English language is eine tee 
and therefore supports the intuitive affirma. 
tive answer given by Jespersen (1922) 
Householder (1946), and Marchand (1953, 
1959) to Question 4. ? 

QUESTION 7: Does the subjective phonetic 
symbolism displayed by speakers of one lan- 
guage have the same pattern as the objective 
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phonetic symbolism found in an unrelated 
language? 

Question 7 is in a sense the “grand ques- 
tion” of UPS. By implication it asks whether 
all observers and all languages share common 
connotations for given sounds. In relation 
with other questions, positive answers to 
Questions 1, 2, 3, 4, 5, and 6 are necessary; 
and sufficient for a yes answer to Question 7. 
However, Question 5 has no answer in the 
literature to date, and we have conclude 
that a no answer must be given to Question 
3. Question 7 has been directly studied using 
variants of word-matching technique, mostly 
using English-speaking subjects. 


Word-Matching Experiments 


In a standard form of word-matching e% 
periments, speakers of a given languas® say 
English, are asked to choose the ee 
translations into their own language ©. ot 
foreign word pairs presented. In a gat 
Ertel and Dorst (1965) asked German i p 
jects to designate which member of @ a 
word pair belonged to one of the two opPerds 
concepts that were defined by a set O a 
in German. The same set of words Let" 
concepts were translated into foreign im 
to be judged. 

Most of Weiss’ criticisms are or 
Taylor’s interpretation of the standard “pen 
matching studies. After reviewing tP? with 
available word-matching studies, some sults) 
Positive and some with negative ise re 
Taylor noted that the spurious post edu 
sults were probably due to proci pter- 
anomalies, Weiss takes issue with t - 
pretation, suggesting that perhaps P gepet 
results were due to inappropriate l 
mental conditions. 

We recapitulate and expand here 
objections to the word-matching in 

l. Translation bias: Taylors me poi 
. . p 
Jection to word-matching as 2 5i ent 
applies to all word-matching studies ‘lv n 
in the literature, as long as they a ba" ja 
words in foreign languages: someb? Y aoe 
decide what words are appropria‘? inv 


directed A 


. F 
tions of what. Tf the translator © ious l 
with the experiment, he may COP? Rose” ic 
unconsciously bias his selections: mp” 


(e.g., 1963) has documented ma” 
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and experimental cases in which such experi- 
menter bias influenced the results even in 
fairly strictly controlled conditions. The 
Tsuru and Fries (1933) experiment is sub- 
Ject to this criticism, Tsuru having translated 
the test words himself, as is that of Maltz- 
man, Morrisett, and Brooks (1956) who used 
Tsurw’s list. 


_An obvious but naive control against this 
las is to use a translator who is not aware 
of the purpose of the experiment. This con- 
trol has been adopted in most of the word- 
Matching studies since Allport (1935) intro- 
duced the service of a translator for preparing 
Stimulus material. Unfortunately, this “con- 
trol” increases the likelihood of a bias in 
favor of correct matching, 


There seldom exists one and only one foreign equiva- 
lent to an English test word. In the process of trans- 
lating English test words, a translator .. . could 
choose the foreign equivalent that more resembles 
or clang-associates with the given English word. For 
ample, in translating orp into a Japanese equiva- 
Ha a translator may choose orra instead of 
OSHITOTTA because o in orp makes him clang- 
ieee to a Japanese word that also has a 
Binning o [Taylor, 1963, p. 202]. 


Besides, OITA is nearer in length to orp 
han TOSHITOTTA, which may also bias the 


pelection in favor of orra. Tsuru and Fries’ 
Ist did in fact use orra, 


UPS is respon- 
0 applies to all 
forms of matching studies, including Davis’, 
which involved nonsense words but still used 


suggest only 
on one dimen- 
second of his 


or his positi 
results. Ertel and Dorst (1965) a, og 


tributed correct matching to expressive sound 
Symbolism, without discussing what “sound 
Complexes fit in sound character to [which] 
respective word-matching.” 

Now if correct matching in an unbiased 
Set of translations could be demonstrated, it 
'S quite likely that the contrasts on which 
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the subjects based their choice might be con- 
trasts of structure, length, or, in auditory 
presentation, expressive voice quality, or 
some other factors. There are probably lin- 
guistic laws analogous to Zipf’s (1935) law 
that common words (most of the studies 
for some reasons avoided uncommon words) 
tend to be short. It is not unlikely that words 
similarly used might be similar in structure 
in different languages purely from the opera- 
tion of laws concerning the economics of 
information transfer. It was this idea, ex- 
pressed differently, that led Tsuru and Fries 
(1933) to originate the word-matching tech- 
nique. They wished to demonstrate the Ge- 
stalt position that “meanings are properties 
of naturally organized wholes or processes.” 

3. Difficulty of associating specific sound 
with specific symbolic connotation: Sup- 
posing Objections 1 and 2 were satisfactorily 
taken care of, there would still be the problem 
of associating specific sounds with specific 
symbolic connotations. One of the central 
issues of UPS is to determine how certain 
sounds are appropriate to certain meanings. 
To take an example from Weiss ( 1963), the 
FAST-SLOW pair was correctly matched to 
K’UAL-MAN (Chinese) and TEz-sust (Hindi). 
Here, FAST, K’uAI, and TEZ or their antonyms 
SLOW, MAN, and susr do not share any com- 
parable phonemes, either for consonants or 
vowels at any position of the words. Some- 
times phoneme matches are even conflicting— 
/a:/ is used for rast in English, and /a/ in 
MAN for slow in Chinese. If some of the 
phonemes there actually do carry the fast or 
slow connotations in the manner of /1/ carry- 
ing small connotation, it is hard to pinpoint 
which sound is carrying which connotation. 
The difficulty is compounded if the extension 
to other languages is considered—vITE-LENT 
(French), PRESTO-LARGO (Italian), PALLEUN- 
NURIN (Korean), HAYAT-osor (Japanese), 
SCHNELL-LANGSAM (German), SKORII-MED- 
LENNI (Russian), RAPIDO-LENTO (Spanish). 

These three problems make the design of 
a valid word-matching test of UPS difficult. 
They have not been solved in the studies 
reported to date, 

4. Method of reporting: A fourth problem 
of the word-matching experiments now in the 
literature Concerns the method of reporting 
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the results. Generally, the measure reported 
is the overall percentage of correctly matched 
pairs. This measure is a compound of the 
amount of agreement between subjects and 
the number of pairs matched more often cor- 
rectly than not. For example, 75% correct 
matching could be produced if subjects al- 
ways agreed, and got 75% of the pairs always 
correct, or if each pair were correctly guessed 
by 75% of the subjects. These two extreme 
possibilities would lead to quite different 
interpretations of the data. As the percentage 
of correct guessing usually reported is 50%-— 
60%, this question assumes major importance. 
We know from many studies (see under 
Question 2, above) that English-speaking 
observers will agree fairly well on most of 
the pairs, so the correctness of their agree- 
ment is the only issue in question. Since most 
of the lists used about 20-30 word pairs, at 
least 70% of the pairs must be matched 
more often correctly than not for the pair- 
matching result to be significant at the .05 
level for a single list. No study since Tsuru 
and Fries (1933) has reported a list matched 
better than 65% correct overall, and only 3 
of 21 lists have given over 60%. Brown 
(1958) reports that in the Brown, Black and 
Horowitz (1955) study, the conceptions were 
correct twice as often as they were incorrect. 
Those authors analyzed Allport’s (1935) data 
in similar fashion and found comparable re- 
sults. They are thus claiming 66% 
Weiss (1963) with four 20-pair list: 
from Brown, Black and Horowitz’s study, ob- 
tained 62% matched more often correctly 
than not, with about 55% correctly matched 
overall. In 80 essentially independent pairs, 
he found 50 correctly matched more often 
than not. This is barely significant at the 
05 level. These results suggest that some 
level around 65% probably represents the 
probability of a pair being matched more 
often correctly than not under typical condi. 
tions. In other words, if a biased translation 
happened only once in 15 words (73 pairs) 
the deviation from chance would be entirely 
explained. g 
The results of current word-matching 
studies cannot be considered as giving evi- 
dence in favor of agreement of connotation 
between English and the various foreign 


accuracy, 
s selected 
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languages used. Brown (1958) has mentioned 
a minor study that further discredits the 
hypothesis. When he presented the word list 
used by Brown, Black, and Horowitz to 
Chinese-speaking subjects in Boston, he ob- 
tained slightly (not significantly) better than 
chance matching overall, but the Chinese sub- 
jects disagreed significantly with the English 
subjects in their direction of agreement On 
the different pairs. Brown says “the results 
as a whole do not strengthen our faith in a 
universal human [phonetic] symbolism.” 
All in all, the results of experiments on 
Question 7 using word-matching type tech- 
nique may be summarized as having involved 
much effort, perhaps in the wrong direction, 
and produced little evidence directly relevant 
to UPS. Such as it is, the evidence points m 
a negative answer, as should be anticipate 
from the negative answer to Question 3. 


Taytor’s FEEDBACK THEORY OF 
PHONETIC SYMBOLISM 


As a background to discussion of the de- 
velopment of phonetic symbolism, we sha 
postulate an extreme case. Suppose 4 a 
guage is produced without reference to <a 
ing languages, and is used as the sole i 
guage of some community. The vocabulary 
of this language is selected by randomly sa 
signing sound combinations to concepts: an- 
wish to suggest that, over time, even this = 
guage will develop phonetic symbolism shi 
that the speakers will show subjective Pse 
netic symbolism when tested with nonse ly 
syllables. By extension from this extrene 
unfavorable case, we postulate that the eties 
mechanisms are responsible for the = 
of phonetic symbolism actually found. ounds 

In our hypothetical protolanguage; ? con- 
occur at random in association Wit? some 
cepts. Sampling fluctuations ensure tha con 
sounds will actually occur more often F thé 
nection with particular concepts tha”. it en 
total vocabulary of the language: ` e pas 
ample, let us suppose that the Janes and 
20 words with the connotation of f 
that / R / is the initial letter in five Si 251 
If / R / is the initial letter of less °° jt f 

n j 
of the words of the language. the «pot 
overrepresented as the initial letter ° 
words, 


> 


, 
| 


ANOTHER LOOK AT PHONETIC SYMBOLISM 


We have ascribed the overrepresentation of 
sounds in words of some concepts solely to 
sampling fluctuations. In natural languages, 
of course, discrepancies between actual and 
chance distribution are much greater. The 
fact that many words associated with one 
concept derive from the same historical root 
ensures the overrepresentation of the sounds 
1n the root word in the daughter words associ- 
ated with the root concept. For example, 
consider in English the persistence of / v / in 
Words dealing with words: verb, verbal, 
verbose, verbiage, and so forth. The very 
existence of language families is discovered 
because of the existence of such patterns. 

If, in our hypothetical protolanguage, a 
Person uses a word with the connotation of 
Cat, he is likely to use a word starting with 
/R/. Whenever this happens, /R/ and “heat” 
tend to become associated with one another. If 
/R/ is also overrepresented in “cold” words, 
then /R/ and cold also become associated, 
There would then be no bias toward /R/ 
eeling cold or warm, but it would be associ- 
ated with temperature in general. However, 
if /R/ happened to be underrepresented as 

e initial letter of cold words, then its 
temperature-related associations would be to 
heat. The speaker of the language would feel 
/R/ to be proper in a “hot” word, and might 
feel that it was in a way wrong in a cold 
Word. He has developed subjective phonetic 


Symbolism for /R/ on the hot-cold scale of 
Meaning, 


ti 
= tho choige of words when he speaks. To 
give the impression of heat, he can use /R/ 
not directly denote 


? : 
: /R/ words will 
appropriate. Hence the effective Tdcabulary 


becomes biased so that the proportion of 
/R/ words used for heat increases and. th 
Proportion used for cold decreases, This the 
Crease in bias aids the development of sub- 
jective phonetic symbolism in other speak- 
ers, who are brought up using the changed 
anguage. 

The thesis is that if objective phonetic 
Symbolism exists in a language, then the 
SPeakers will develop subjective phonetic 
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symbolism: If speakers have subjective pho- 
netic symbolism, then the language will de- 
velop objective phonetic symbolism, The two 
patterns will be identical. 

This direct analysis of the effects of feed- 
back on phonetic symbolism suggests that 
each sound should eventually become masi- 
mally associated with one or other pole of 
each conceptual scale. This result is patently 
ridiculous, and is due to considering the con- 
ceptual scales as if they were entirely sepa- 
rate. In fact, association of a sound with 
one concept implies its association to some 
degree with almost all other concepts in 
the language. This interlocking nature of 
the language ensures that extreme polariza- 
tion seldom occurs, since polarization on 
some dimensions tends to depolarize other 
dimensions, 

The overall resulting pattern of phonetic 
symbolism is the result of two opposed tend- 
encies. Some sounds are strongly polarized 
on some conceptual scales and unpolarized on 
other scales. Some scales tend to have many 
polarized sounds; others, few. One may 
hazard a guess that cultural patterns may 
affect which scales tend to strong polariza- 
tion. Taylor and Taylor (1962) suggested 
that cultural patterns play a part in deter- 
mining which conceptual scales tend to show 
related patterns of polarization. 

As a language evolves, and dialects diverge, 
so should the pattern of phonetic symbolism 
change. Where sound shifts occur, such as the 
shift from initial /t/ to /ts/ (Z) in High 
German, the phonetic symbolic values should 
follow the shift. Where an English observer 
finds initial /t/ to have some particular con- 
notation, the German should find initial /ts/ 
to have the same connotation. The possibility 
arises that correlations of phonetic symbolism 
patterns in monolinguals might be used as a 
measure of historical relationship among lan- 
guages. The idea is like that of determining 
the similarity of a set of words for basic 
concepts (Glottochronology; Hockett, 1958), 
except that no such arbitrary listing is re- 
quired. The probability that cultural patterns 
affect the patterns of phonetic symbolism 
means that this type of measure must be 
severely tested before it is assumed to be 
reliable. 
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The analysis above has been largely in 
terms of initial sounds. The theory makes no 
distinction. Wherever associations may con- 
sistently recur between sounds and concepts, 
there phonetic symbolism will be found. The 
most prominent sounds in words are prob- 
ably the initial sounds, the first vowel, and 
the stressed vowel. These points might be 
expected to contribute most to phonetic sym- 
bolism patterns, but only because of their 
prominence. Taylor and Taylor (1962) found 
no consistent difference in the strength of 
symbolism between initial and final conso- 
nants in CVC nonsense syllables. Of the 16 
possible comparisons in 4 languages, 7 showed 
symbolism significant with p < .001 for both 
initial and final consonant, 3 showed the final 
consonant to give more significant symbolism 
than the initial consonant, and 6 showed the 
initial consonant more significant than the 
final. 

While the theory does suggest why certain 
scales of meaning show more phonetic sym- 
bolism than others, and why the strongly 
symbolic scales vary from language to lan- 
guage, it does not give a reason why speakers 
of one language are more susceptible to 
phonetic symbolism than are speakers of 
another language. Both Miron (1960 1961) 
and Taylor and Taylor (1962) found Japa- 
nese speakers giving far more strongly sym- 
bolic scores overall than did their English- 
speaking subjects. Taylor and Taylor also 
found Tamil speakers to give less variation 
in scores than English speakers give, These 
results occurred in spite of the fact that Japa- 
nese and Tamil speakers showed equal inter 
subject consistency, while English speakers 
showed more intersubject consistency than 
ia There seems to be a real effect. Dif- 

erent communities are differ Dti 
to phonetic symbolism. wai susceptibla 


CONCLUSION 


Though there have been numerou i 
I s studies 
of the UPS hypothesis, there is still little 
direct evidence on several of the . 


tions. However, the logical hierar 
the different questions of the e 
phonetic symbolism suggests that 
given in the evidence may be ac 
few qualms. Phonetic symbolism 


basic ques- 
chy among 
xistence of 
the answers 
cepted with 
occurs in the 
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words of a language, and it is detected in 
arbitrary phoneme clusters by speakers of a 
language. The symbolic connotations of a 
particular phoneme are the same in both 
cases. The symbolic connotations are not the 
same in the words of or between speakers of 
unrelated languages. 

While the question of existence appears 
settled, the question of degree remains to be 
answered. For example, the results of Miron 
(1960, 1961) and Taylor and Taylor (1962) 
suggest that Japanese speakers are more sus- 
ceptible to phonetic symbolism than are 
English speakers, and that within each grouP 
different dimensions show different strengths 
of symbolism. There is a suggestion (er 
man, 1933, Taylor and Taylor, 1962) aie 
the consistency of subjective phonetic sym 
bolism may increase slightly with age. ard 
a trend would be anticipated by the langue 
dependent theory, Parametric studies of the 


> r 

and allied questions would go far PU n 
clarifying the remaining areas of confus! 
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IS TIME-OUT FROM POSITIVE REINFORCEMENT 
AN AVERSIVE EVENT? 


A REVIEW OF THE EXPERIMENTAL EVIDENCE* 


HAROLD LEITENBERG ? 


Indiana University 


Experiments concerning the aversive properties of time-out (TO) from positive 
reinforcement are reviewed. A discussion of experimental designs employed 
and problems encountered in defining aversive stimuli precedes the review. 
The major topics covered are: avoidance of TO, escape from TO, escape 
from stimuli which previously signalled TO, punishment with TO, effects on 
ongoing behavior of pre-TO stimuli, escape from conditions of positive re- 
inforcement into TO. In general, TO satisfied sufficient criteria to conclude 
that it belongs to the class of stimuli called “aversive.” However, since some 
differences between the effects of shock and TO were found; since some 
experiments allow other interpretations than that TO is aversive; and since 
so much research remains to be done, this conclusion is only tentative. 


Withdrawing positive reinforcement and 
eliminating the opportunity to obtain positive 
reinforcement are commonly employed in our 
society as punishments for behavior judged 
to be undesirable. Sentence to prison, dis- 
missal from work, loss of driver’s license, and 
withheld allowance are a few examples. It is 
only in the last decade, however, that the 
aversive properties of the removal of positive 
reinforcement have begun to receive an 
experimental analysis of some depth. 

The present paper reviews this experi- 
mental literature, with special emphasis put 
on those studies that represent different pro- 
cedural approaches to the problem. A short 
discussion of what experimental operations 
might conceivably be subsumed by “time-out 
from positive reinforcement” precedes the 
review as does an equally brief summary of 
the usual defining criteria of an aversive 
event. 

This paper’s scope does not extend to a 
discussion of theories which suggest that 
omission of reward induces in the organism 
an emotional state labeled frustration (Brown 
& Farber, 1951, Lawson & Marx, 1958b). 


1 This manuscript was prepared while the author 
had a predoctoral fellowship (1-FI-MH-23, 283-01) 
from the National Institutes of Health. The author 
would like to thank James A. Dinsmoor for his 
helpful criticisms of an earlier draft of this paper, 

2 Now at the University of Vermont, 


5 e cues 
This hypothesized internal state and th 


associated with it are sometimes said a dis- 
specific aversive properties, as well “pions 
criminitive, and diffuse energizing fU” 
(eg., Amsel, 1958, 1962; Marx, 1956): 


. torte” 
Definition of TO from Positive Reinfo" 
ment , ybich 
There is no single set of operations 1 po i- 
adequately defines time-out (TO cae js 4 
tive reinforcement, The essential feat forc?” 
period of time in which positive T®' 
ment is no longer available. ers 


iv 

Within this boundary can fit such d pi 
experimental operations as extinction? swe 
discrimination training, and intervals rein 
reinforcement in various schedules a 3 
forcement. These three procedures iy ms 
tinguished in part by differences perio 
amount of stimulus overlap betwee ment: he 
of reinforcement and nonreinforce™ nal m 
extinction there are no cues which P s « 
absence of positive reinforcements off Pd 
absence of reinforcement is Sêt ‘p sch® 


change in stimulus conditions, 2° eriod® i 
ules of reinforcement the cues io ased giy 
nonreinforcement might be ec us of 
response produced, These proce red! enc 
differ, also, in the duration an gil” 
nonreinforced periods. eons 2 due 
Studies which have used PÍS? ckout 


jects conventionally establish 4 
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mg TO; all house lights and key lights are 
turned off. Since pigeons tend to be immobile 
m darkness, in effect the opportunity to re- 
Spond is eliminated as well as the positive 
reinforcement. This could, therefore, be con- 
ae an example of blocking or interrupt- 
& an instrumental response chain which 
ormerly led to food reinforcement. 
Another possible way to arrange a TO 
i Positive reinforcement is to block the 
ummatory response; reward can be visi- 
nH but inaccessible. A barrier can separate 
Pa an from the food, or the mouth 
Piha muzzled to prevent eating but not 
Mian Reward can also be presented and 
sibl awn so quickly that eating is impos- 
& 
ae has not been sufficient work done in 
oy area to show what different effects might 
În To en asa function of these variations 
all i sees Different effects might be 
rele! however, simply because some pro- 
teini contain more elements of secondary 
Mars os than others (e.g., Lawson & 
tistics 58a). Direct procedural com- 
need to be made within each of 


Ara paradigms used to define an aversive 
mulus. 


Sa POEN 
Cfinition of an Aversive Event 


r Discussion of what is meant 


aversive” 2 = 


Parnas en will be restricted to the 
a al sphere because the only other 
emen ies deal with subjective 

<n =o! pain or unmeasured physiologi- 

PA oe ibrium, These are obviously fruit- 

hae oe is a “hypothetical con- 
the deini nas no meaning independent of 

td > experimental operations and 

rien behavioral effects, 

te a event 1s empirically defined as aversive 

ehavior which prevents or terminates it 


is maintained above oper: 
Perant le 
| Schoenfeld, 1950). This re who (Keler st 
p nts the avoid- 


ance and escape paradigms respectively 
other empirical definition of an aaee 
_mulus is that a response which produces it 
vill decrease in frequency below its operant 
«Vel, This is what Mowrer (1960) calls 
Passive avoidance” and Kimble (1961) calls 

s pP Wnishment training.” A fourth possible de- 
ing property of an aversive stimulus is 
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exemplified by the CER (conditioned emo- 
tional response) or conditioned suppression 
phenomenon (Estes & Skinner, 1941). Here, 
ongoing behavior maintained by a schedule 
of positive reinforcement is suppressed in the 
presence of a signal which systematically pre- 
cedes and terminates coincident with the ad- 
ministration of electric shock. Finally, if a 
stimulus is a primary negative reinforcer, 
then neutral stimuli which set the occasion 
for its occurrence should themselves acquire 
secondary negative reinforcing properties. 

One obvious problem with these different 
ways of measuring aversiveness is that they 
may not agree with each other. The expla- 
nation might simply be that they are not 
equally sensitive. Or, they may not be 
equivalent measures. It is an empirical ques- 
tion whether a stimulus meeting one criterion 
would meet any other. There doesn’t seem 
to be any explicit basis, either, for con- 
sidering one criterion “better” than another 
though a bias exists in the literature in 
favor of avoidance and escape learning. 

Another difficulty is introduced by the fact 
that an event such as sudden stimulus change 
will satisfy for a short period of time the 
criterion measures of punishment training 
and conditioned suppression. Does this make 
stimulus change an aversive event? Or must 
duration of effect also be considered in the 
definition of a negative reinforcer? To fur- 
ther complicate matters, under certain test 
and training conditions stimulus change will 
act as a positive reinforcer (e.g., McCall, 
1965). 

Another ambiguity in the definition of 
aversive stimuli is presented by the case of 
satiation. It seems to satisfy the punishment 
criterion of an aversive event; if excessive 
amounts of reward are received, responding 
will be depressed. Note that repeated ex- 
posure to reward is response contingent, just 
as shock is response contingent in punish- 
ment training. Yet no one considers satiation 
to be aversive—unless its consequence is 
indigestion. This suggests that a complete 
definition of punishment requires further 
specification which would exclude the case 
of satiation. 

It seems most reasonable, then, when trying 
to determine the aversive properties of a 
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given event or stimulus, to use as many of 
these different defining techniques as possible. 
The more the measures agree with each other, 
the more certain it is that the event is aver- 
sive. This view is predicated on the relatively 
close agreement between measures when elec- 
tric shock is the aversive stimulus in ques- 
tion, and it governs the manner in which the 
rest of this paper is organized. 

Many of the studies reviewed in this paper 
were carried out in the operant tradition of 
repeated measurements on a small sample of 
subjects. The reliability of the data for the 
individual Ss is, therefore, of a superior order 
than usual group data, but the generality of 
the reported finding is sometimes left in 
doubt. 


Avoidance of TO from Positive Reinforce- 
ment 


Experiment IV in Ferster’s stimulating 
1958 monograph illustrates one procedure for 
testing whether or not Ss will learn to avoid 
TOs. Chimpanzees were employed as Ss, 
Two levers located 12 in. apart were present 
simultaneously in the experimental enclosure. 
Responses on the right-hand lever were 
positively reinforced on a variable interval 
3 min. schedule. A continuous avoidance 
schedule was programed on the left-hand 
lever. If no avoidance response occurred, a 
TO of 3 min. was scheduled every 45 sec 
Each response on the left-hand lever how- 
ever, postponed the TO from the VI schedule 
for a certain duration. These durations 
which Sidman calls response-stimulus (R-S) 
intervals in continuous shock avoidance (Sid- 
man, 1953), were systematically varied: they 
were 1, 2, 5, and 10 min. Ferster found an 
increase in rate of avoidance respondin 
the interval was decreased, in 
f Morse and Herrnstein (1956) reported a 
similar study, using pigeons as Ss. Again two 
response alternatives were present concur- 
rently. One response was maintained by 
variable interval schedule of a 
forcement, and the other res 
TOs from this schedule. As with the chimps 
response rate on the avoidance key isticaast 
as the R-S interval was decreased 

Thomas (1964) showed that when th 
mean interval between reinforcements ba 


Positive rein- 
ponse postponed 
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changed from 9 min. to 1 min., the number 
of TOs that were avoided decreased. Appar- 
ently when the frequency of reinforcement 1$ 
higher, pigeons are more “willing” to tolerate 
TOs from the schedule of reinforcement. This 
is unexpected, because at short mean 

intervals, Ss lose more reinforcements pet 

than at long mean VI intervals. Avoidance 
of TO is possibly governed in part by some 
complex relationship between relative 1056; 
absolute loss, and absolute frequency ° 
reinforcement. This remains to be nore 


out. dy 


Mechner and Ray (1959) demonstrate” 
that rats will learn to avoid TOs from * 
fixed-interval 15-sec, schedule of reinforce 6 
ment. In order to avoid a 2-min. inter or 
of this schedule, Ss had to refrain P 
pausing longer than 3 sec. between succes? 
responses on the food reinforced bar- 
this contingency was removed, interres 
times became longer. 

Baer (1960) investigated whethe! 
garten-age children will learn to avo! 
film cartoons cut off, He found an “es 
avoidance schedule” to be quite effect st- 
this procedure each avoidance response it 
poned the next film interruption in @ c RS 
lative manner, For instance, i i nse 
interval was 5 sec, and S made 20 resp ond 
in rapid succession and then stopped Fr 


pons? 
r kinder 


d havin’ 
calato® 


until 


When Baer used the usual Sidman = i 
Procedure—where each response eevee! a 
the timer controlling the delay be" y“ i 


sponse and TO—he failed to maint his "Is 
ance behavior with this age grouP: inter 
ure might have been caused by be 
that were too short—3, 5, and 
because the Baer experiment, “TO 
others, did not study avoidance Q 
a schedule of response continge” squire? gs 
reinforcers, Initially, Ss were not d carton, 
make a response to produce the pra ave 
The prevalent explanation © sr pio? jp 
ance by reinforcement theorist 953) í 
1954; Schoenfeld, 1950; Sidma™ hce of 
terms of the immediate consed suction a? 
avoidance response, namely, f n 7 
conditioned aversive stimuli. “gpa” 
avoidance, all behavior ot® 


IS TIME-OUT FROM POSITIVE REINFORCEMENT AN AVERSIVE EVENT? 


avoidance response is sooner or later closely 
followed by shock, and therefore, tactile, 
proprioceptive, and environmental stimuli 
Produced by these nonavoidance responses 
acquire aversive properties. The avoidance 
response is supposedly reinforced because 
it terminates these stimuli. This formulation 
could explain the data which illustrates 
avoidance of TO from positive reinforcement. 
It is only necessary to assume that response 
Produced stimuli which precede the loss of 
Positive reinforcement acquire secondary 
negative reinforcing properties. A later section 
this this paper reviews studies related to this 
assumption, 

But another interpretation is possible—one 
that suggests that none of these TO avoid- 
ance studies provide clear evidence that TO 
from positive reinforcement is an aversive 
event. It could be said that these studies 
Only indicate that Ss will maintain behavior 
which leads to positive reinforcement. The 
Concept of aversiveness seems unnecessary. 
Rather it Seems adequate and more parsim- 
Monious to explain these findings just by say- 
ing: The pattern of behavior followed by 
Most positive reinforcement is most strength- 
ened. For in all the studies so far reviewed, 
S gains more positive reinforcement during a 
Session if he makes the avoidance response 
than if he fails to make it, 

Note, though, that in the studies cited, the 
avoidance response is often made on a dif- 
ferent manipulandum from the one which 
leads to food reinforcement. Also some form 
of “change over delay” is usually arranged 
so that the avoidance response can never 


immediately precede the delivery of food and 
thereby receive adventiti 


sec. of an avoidance response. 

Nonetheless, over a given period of time 
avoidance behavior in conjunction with be- 
havior on the food key will lead to the 
greatest amount of positive reinforcement 
No other behavior pattern will do the same. 
And it is commonplace to note that behavior 
Which successfully leads to the most Positive 
reinforcement is maintained while other be- 
Navior is eliminated. If Ss are given a choice 
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between the right arm of a T maze in which 
reward is present on every trial, and the left 
arm of a T maze in which reward is present 
on only half of the trials, it is likely they 
would show a preference for the right arm. 
This does not make the left arm aversive; 
it just makes it less preferred. Few theorists 
would label choice of the right arm an avoid- 
ance response. Similarly, responding on both 
the avoidance bar and the food bar is the 
preferred response pattern because it leads 
to more reinforcement in a shorter time 
period. 

This interpretation in no way negates the 
Possibility that avoidance responding in these 
studies is maintained primarily because it 
postpones an aversive event—TO from posi- 
tive reforcement. It is just unfortunate that 
the studies are not designed in a way which 
would make it possible to untangle the two 
interpretations, 

What would be more convincing is a 
demonstration that Ss will learn to avoid 
stimuli which previously set the occasion for 
TOs from positive reinforcement. Unlike 
studies reporting avoidance of TO itself, the 
avoidance response here would no longer have 
the added effect of increasing the amount of 
positive reinforcement S gets during a session, 
compared to what he would get if the avoid- 
ance habit had not been established. 


Punishment Training with TO from Positive 
Reinforcement 


In the punishment procedure, the aversive 
stimulus is response contingent, and atten- 
tion is focused on the subsequent decline in 
the frequency of the punished response. The 
responses that take the place of the pun- 
ished response—avoidance responses—are not 
usually examined, 

Ferster has shown in a series of studies 
(Ferster, 1958, Exp. III; Ferster, 1960; 
Ferster, Appel, & Hiss, 1962) that chimps 
and pigeons will stop responding in the pres- 
ence of a signal if responding leads to a TO, 
and if cessation of responding serves to avoid 
TO. A signal, such as a change in the color 
of the lighted disc at which the pigeon is 
pecking for food reinforcement, precedes a 
scheduled TO by a fixed duration. If the S 
stops pecking at the disc in the last 5 sec. 
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of the signal’s presence, then the signal will 
terminate without being followed by a TO. 
If S continues to respond, a TO follows after 
the signal terminates. With pigeons, respond- 
ing is almost completely suppressed during 
the entire time the signal is present. Chimps 
make a finer discrimination, continuing to 
respond until the critical period of the signal 
is reached. 

This procedure should not be confused, 
as it sometimes seems to be, with the 
more familiar Estes-Skinner CER paradigm 
(1941). In the CER procedure, response sup- 
pression does not serve to avoid shock; 
changes in behavior in the presence of the 
CS have no programed consequence upon 
the delivery of shock. A later section of this 
paper reviews those studies which compare 
shock and TO in the CER paradigm. 

Baer (1962) was able to suppress thumb- 
sucking in nursery-school age children by in- 
terrupting film cartoons whenever the child 
had his thumb in the mouth. When the pun- 
ishment contingency was withdrawn, thumb- 
sucking reverted to its earlier frequency. 

These punishment studies are as difficult to 
interpret as the avoidance studies were, The 
response contingent TOs reduce frequency of 
reinforcement per unit time. The higher posi- 
tive reinforcement frequency maintained by 
the pattern of behavior that evolves (re- 
sponse suppression) seems sufficient explana- 
tion of the results. There is no need to 
postulate that the TOs were aversive in these 
studies. 

It is possible to schedule contingent TOs 
(punishment with TO from positive rein- 
forcement) in such a way that response sup- 
pression (should it occur) would not affect 
overall frequency of reinforcement, 

Using mental hospital patients as Ss, Holz 
Azrin, and Ayllon (1963) demonstrated that 
TO from Positive reinforcement has relatively 
weak Suppressive powers when it js made 
contingent upon an operant response main- 
tained at the same time by some positive 
reinforcement, in this case cigarettes, Every 
10th response produced a TO of 30 sec 
Cigarettes were delivered for the same re- 
sponse on a variable-interval schedule. In this 
procedure, response Suppression, as long as it 
was not complete, neither increased nor de- 
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creased frequency of reinforcement. Two out 
of the four Ss showed a moderate decline in 
response frequency when the TO contingency 
was introduced. The other two Ss showed no 
change in performance until a new response 
alternative was offered, one which continued 
to produce cigarettes on the same VI schedule 
but did not produce any TOs. Under 
these conditions all four Ss stopped respond- 
ing completely on the manipulandum leading 
to TOs and switched over to the new unpun- 
ished response. 

The results of this study suggest that TO 
is, at best, a very poor punisher: shock 
would suppress responding before the second 
response alternative was introduced. How- 
ever, it is worth noting that in the same 
paradigm loud noise had an effect more 
similar to TO than to shock (Herman & 
Azrin, 1964). The results of the Holz et al. 
study also support the hypothesis that TO 
acts like an aversive stimulus only when the 
behavior which satisfies the aversive criterion 
maintains a higher frequency of reinforce- 
ment than would be the case if it did not 
occur, 

Matching to sample has been a popular 
technique for studying how TO affects be- 
havor which produces it, Much of the wor 
by Ferster and his associates has been done 
with pigeons, Birds must peck at a center 
key to produce a sample stimulus, usually 
a particular key color, After some fixe 
period, the light behind the center key ' 
turned off and the two side keys are lit YP: 
Tf the bird pecks the side key whose apes 
Corresponds to the former color of the cente 
key, he is reinforced on some interval S€ Ti 
ule. Tf he pecks the incorrect key, 2 -oii 
produced. A noncorrection procedure 1$ e i 
ployed so that once § makes his choice ™ y 
sponse on one of the side keys, both side i 
lights go off. Tf an incorrect response is Ma" 
the center key is relighted after the a 
over. Every correct response is reinfor" n 
with a brief flash of the magazine ligh 
While food is delivered on some schedule- , 


The major variables which have bee? jie 
vestigated in the matching-to-sample g 

are TO duration, the schedule by whieh, ce" 
is programed, and the schedule of re! The 


. Si 
ment for correct matching response 
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first finding was that when every correct re- 
Sponse was positively reinforced rather than 
Intermittently reinforced, TOs following in- 
Correct responses had little effect on match- 
mg accuracy (Ferster & Appel, 1961). As 
Ferster and Appel pointed out, interval sched- 
ules are generally more sensitive to independ- 
ent variables than continuous reinforcement 
or ratio schedules. These authors found that 
when correct matching responses were rein- 
forced on a VI 3-min. schedule, matching 
accuracy progressively improved as the TO 
duration for incorrect responses was raised 
from 1 to 10 to 30 to 60 sec. A TO duration 
of 2 min. disrupted matching performance 
by depressing both correct and incorrect 
responses indiscriminately. 

Zimmerman and Ferster (1963) reported 
a similar finding with different TO durations. 
A 10-min. TO following incorrect responses 
Impaired matching accuracy, while a middle 
range of TO durations improved matching 
accuracy, This study also showed that match- 
Ing accuracy declined as the number of 
Incorrect responses required to produce a TO 
Was increased. 

These matching-to-sample studies provide 
More substantial evidence than the other 
Punishment studies that TO acts like an 
aversive stimulus. It is the generalized sup- 
Pression of both incorrect and correct re- 
sponses by long TOs that is important, This 
Suppression does not have the added effect 
of increasing reinforcement frequency per 
unit time. In fact, response supression here 
leads to fewer reinforcements than if it did 
not occur, 

Holder, Marx, Holder, and Collier (1957) 
published an interesting experiment which 
could be interpreted as illustrating the pun- 
ishing effects of interrupting an instrumental 
response chain ending in positive reinforce- 
ment. Before they were allowed to complete 
their run to a goal box containing food, rats 
were delayed for different periods of time at 
a halfway point in a straight alley. The group 
that was delayed the longest took the longest 
time to run to the detention box. Possibly 
the running response was disrupted because 
Ìt lead to the section of the runway associated 
With delay, a section which acquired aversive 
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properties and provoked an approach-avoid- 
ance conflict. 

Though the required length of a complete 
treatment is prohibitive for this paper, it 
seems desirable to at least briefly discuss the 
relevance of experimental extinction. The 
experimental operations of extinction of a 
positively reinforced response satisfy the 
punishment paradigm; the consequence of 
each formerly reinforced response is the 
omission of reward, and this paper assumes 
that omission of reward is a substantive 
stimulus. The question is whether or 
not it acts like other aversive stimuli in the 
punishment paradigm. In general, it does. 
The frequency of a formerly reinforced re- 
sponse declines when reward is withdrawn, 
just as it does when shock is introduced. 

Although Amsel has not used the term 
punishment, his analysis of the inhibitory 
effects of nonreinforcement is in essence a 
punishment analysis. He suggests that ab- 
sence of reward produces a motivational state 
of frustration which is chained back from the 
goal box and weakens the instrumental ap- 
proach response. The approach response is 
weakened because it produces conditioned 
anticipatory frustration stimuli which are 
aversive. Consequently, responses which com- 
pete with the approach response are strength- 
ened because they reduce this conditioned 
aversive frustration. This is the familiar 
avoidance explanation of punishment (Dins- 
moor, 1954, 1955). In fact, Amsel says, “the 
greater the strength of frustration the faster 
the extinction (avoidance) of a nonrewarded 
response [1962, p. 308].” No doubt 
Dinsmoor would discard the concept of 
frustration as unnecessary, just as he dis- 
cards the concept of fear when shock is the 
punishing stimulus. A more behaviorally 
oriented statement would be: because omis- 
sion of reward is aversive the stimuli pro- 
duced by the formerly reinforced response 
acquire aversive properties; therefore, other 
responses which terminate these conditioned 
aversive stimuli are strengthened at the 
expense of the formerly reinforced response. 

Whether or not the avoidance hypothesis 
originally offered in explanation of the facts 
of punishment can account for the various 
phenomena of extinction is still unknown. It 
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is certain, however, that the adequacy of such 
a theoretical account will depend a great deal 
on a finer experimental analysis of the simi- 
larities and differences between extinction 
(punishment with TO) and punishment 
(punishment with shock or other supposed 
aversive stimuli) than currently exists. For 
example, it would be interesting to know if 
spontaneous recovery and disinhibition occur 
with punishment as well as with extinction. 
To the author’s knowledge, no such compari- 
sons have ever been made. 

Other comparisons have been made, how- 
ever, and the evidence suggests that there are 
considerable similarities between the effects 
of extinction and punishment. A recent para- 
doxical finding that has been repeated quite 
often is that with a greater number of train- 
ing trials there is less, not more, resistance to 
extinction (see Paul, 1965, for a related re- 
view). Similarly, it has been reported that 
as the number of training trials increases, the 
suppressive effects of punishment increase 
(Karsh, 1962; Miller, 1960). 

Karsh (1964b), using a discrete trials 
operant procedure, studied the effects of 
intertrial intervals on the suppressive action 
of punishment. During the punishment phase 
of the experiment, Ss undergoing 1 trial 
a day showed more resistance to the suppres- 
sive effects of punishment than Ss receiving 
10 or 50 trials a day. Although the results 
with extinction are not always consistent, the 
majority finding is that with spaced trials 
during extinction there is also greater re- 
sistance to extinction (Kimble, 1961, p. 283). 

Reynolds observed “behavioral contrast” 
in pigeons with both response contingent 
shock and nonreinforcement. When respond- 
ing in the presence of one stimulus was pun- 
ished and suppressed with shock, rate of 
responding maintained by a VI schedule of 
reinforcement in the presence of another 
stimulus increased (Brethower & Reynolds, 
1962). Similarly, when reinforcement for re- 
sponding in the presence of one stimulus was 
withdrawn, rate of responding increased in 
the presence of another stimulus (Reynolds, 
1961). 

Brown and Wagner (1964), in a fascinat- 

ing study, found that gradual exposure to 
shock during training increased resistance to 
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extinction on later test trials and increased 
resistance to the suppressive effects of more 
intense shock. The converse was also true: 
occasional nonreinforced trials during train- 
ing later increased both resistance to the sup- 
pressive effects of shock and resistance to the 
suppressive effects of extinction, This is a 
particularly clear demonstration of the simi- 
larity of shock and loss of reinforcement. 

Likewise, Karsh (1964b) found that add- 
ing punishment to food reinforcement during 
a part of acquisition increased resistance tO 
extinction. This finding is reminiscent of the 
partial reinforcement effect: interspersed non- 
reinforced trials during acquisition, as com- 
pared to continuous reinforcement during ac- 
quisition, increases resistance to extinction. It 
would be useful to make a parametric in- 
vestigation of the effects of different percent- 
ages of punishment trials during acquisition 
on resistance to extinction. 

Some drugs produce similar effects on Te 
sponding suppressed by shock and respond- 
ing suppressed by experimental extinction. 
Sodium amytal and alcohol lessen the degree 
of Tesponse suppression during approach- — ; 
avoidance conflict periods (Barry & Miller, 
1962). These two drugs also lessen the de- 
gree of response suppression caused by the 
withdrawal of reinforcement during extinct- 
tion (Barry, Wagner, & Miller, 1962). 

Fewer experiments demonstrating differ- 
ences between extinction and punishment 
were found. Holz and Azrin (1963) suggest? 
that extinction suppresses behavior we 
quickly and less completely than moderate 
levels of shock, and that when conditions ° 
extinction and punishment are withdraw? 
and reinforcement is reintroduced, TesP° 
ing recovers more rapidly after extinction 
than after punishment, As Holz and ee 
Suggested, these differences may simply 
due to differences in degree of aversivenee 
between response-contingent extinction a 
response-contingent shock, Other differen” 
are (a) A greater magnitude of eve 
during training produces less resistance? al 
extinction (Amsel, 1962, cites S°Y® js 
studies); while the converse relationsh'P ce 
found between size of reward and ng This 
to punishment (Bower & Miller, 1960): © js 
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probably a relationship between the size of 
the reward that is withdrawn and the degree 
of aversiveness of extinction, while no such 
relationship exists between the size of reward 
and degree of aversiveness of shock. Further 
tests of this hypothesis need to be made in 
aversive paradigms other than punishment. 
(b) The total number of responses emitted 
during extinction was unchanged even though 
an initial period of mild punishment tempo- 
rarily suppressed responding (Estes, 1944), 
Suggesting that the two procedures control 
response frequency via different processes. 


Escape from TO and Escape from Stimuli 
Associated with TO 


Any study which shows operant behavior 
being maintained by a schedule of positive 
reinforcement can be interpreted as demon- 
Strating escape from periods of nonreinforce- 
ment. Responding in the intervals between re- 
inforcements can be said to serve two func- 
tions at the same time—terminating a period 
of nonreinforcement and leading to the 
receipt of positive reinforcement. This is, 
obviously, not very satisfactory evidence that 
TO from positive reinforcement has negative 
reinforcing properties, 

A study in which the escape response did 
not simultaneously lead to positive reinforce- 
ment and to termination of the supposed 
aversive stimulus was done by Adelman and 
Maatsch (1956). Three groups of 10 rats 
each were trained to jump 10 in. to a perch 
Surrounding the top of a goal box. The goal 
box was at the end of a straight alley. The 
ewer for jumping out of the goal box varied 
“a ae groups. For one group the goal 
st iO iad contained food reward 

’ & up the alley. Two other groups 
without this experience received equal 
amounts of direct placements into an empty 
goal box before the jump-out phase of the 
experiment was introduced. The first group’s 
supposed reward for jumping out of the goal 
box was escape from an aversive frustrating 
situation, that is, an empty goal box in place 
of one with food. The second group received 
a pellet every time they jumped out of the 
box and onto the surrounding perch. The 
only possible reinforcement for the third 


8toup was escape from confinement and/or 
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satisfaction of some exploratory drive. The 
results clearly demonstrated that when the 
box was associated with the removal of food, 
Ss jumped fastest and continued to per- 
form at the same level for over 100 trials. 

This is a somewhat surprising finding. 
Ordinarily, one would expect a stimulus asso- 
ciated many times with reward to become 
a secondary reinforcer and if it is never again 
paired with the primary reinforcer to lose its 
effectiveness as a secondary reinforcer gradu- 
ally. Then it would revert back to a neutral 
stimulus or, perhaps, as this study suggests, 
become an aversive stimulus. It is not sur- 
prising that the stimulus becomes aversive, 
but it is surprising that the stimulus changes 
its function so rapidly. Although Ss con- 
tinue to run to the empty box, once they 
get there and find no reward they escape. 
Perhaps being kept in the empty goal box for 
5 min. before being allowed to escape is a 
critical factor. 

More recently, Wagner (1963) has shown 
that rats will learn to jump over a hurdle 
to terminate stimuli associated with the ab- 
sence of reward. Two groups of rats equated 
for weight were trained to run down an alley 
on a 50% partial reinforcement schedule. 
For the experimental group, an auditory or 
visual CS was presented in the alley and in 
the goal box on nonreinforced trials. For the 
control group, the CS was presented in the 
retaining cage and did not signal nonrein- 
forcement. In test trials, experimental Ss 
jumped over a hurdle faster than control Ss 
when this response served to terminate the 
CS. As training progressed, escape latencies 
decreased for the experimental group while 
they increased sightly for the control group. 

These two studies are crucial for two 
reasons. First, the best evidence so far that 
TO from positive reinforcement is aversive 
comes from studies done in the punishment 
paradigm, but convention has it that escape 
and avoidance are more important criteria 
of aversive stimuli. As suggested earlier, 
previous experiments trying to demonstrate 
avoidance and escape from loss of reinforce- 
ment had a flaw. It was not clear whether 
the avoidance behavior—which was success- 
fully demonstrated—was maintained because 


an aversive event was thereby postponed or 
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because a greater frequency of reinforcement 
was thereby assured. Escape behavior in the 
Adelman and Maatsch experiment and in the 
Wagner experiment, on the other hand, does 
not appear to have any other reinforcement 
than the removal of what must be consid- 
ered conditioned aversive stimuli. To insure 
the reliability of this phenomenon, however, 
it would be useful to have a few more 
examples. 

Secondly, these two studies attest to the 
validity of the major assumption of the 
usual explanation of avoidance behavior. For 
both shock avoidance and TO avoidance, the 
assumption is that neutral stimuli paired with 
the stimulus to be avoided will acquire 
secondary negative reinforcing power. 

This is the last evidence that the author 
will review which strongly supports the notion 
that TO is an aversive stimulus. The remain- 
ing experiments could be considered negative 
evidence; however, “qualifying evidence” 
seems the more appropriate term. 


Effects of Pre-TO Stimuli 


Ferster (1958, Exp. I) has published some 
data concerning the effects on ongoing be- 
havior of a signal which precedes a TO 
from a schedule of reinforcement. He em- 
ployed a variant of the Estes and Skinner 
(1941) CER paradigm which uses shock as 
the aversive stimulus. 

In the typical CER procedure, stable refer- 
ence behavior is maintained by a variable- 
interval schedule of reinforcement. Periodi- 
cally during the session, a CS is presented for 
a fixed duration, and coincident with its 
termination a shock is delivered. Since the 
reinforcement schedule continues in effect 
while the CS is present, reward is potentially 
obtainable in the presence of the CS. None- 
theless, the repeated finding has been that 
ongoing behavior maintained by a schedule 
of positive reinforcement is suppressed in the 
presence of preshock stimuli (see Annau & 
Kamin, 1961; Brady & Hunt, 1955; Hoff- 
man & Fleshler, 1961; Hunt & Brady, 1955: 
Lyon, 1963; Stein, Sidman, & Brady, 1958). 
The CS tends to elicit conditioned fear Te- 
sponses such as freezing and defecation which 
supposedly interfere with the ongoing rein- 
forced behavior (e.g., Hunt & Brady, 1955). 


In Ferster’s study, the first response after 
the CS had been on for 30 sec. produced the 
TO. (Note that this procedure is decidedly 
different from the response-independent CER 
paradigm. In fact, it could be considered 
more representative of the punishment pro- 
cedure than of anything else.) In any case, 
Ferster found that for chimps, response rates 
accelerated above control levels in the pres- 
ence of the pre-TO stimulus. 

Herrnstein (1955), in an unpublished 
thesis, employed the same response con- 
tingent procedure and reported similar find- 
ings with pigeons, but only when the VI 
schedule of reinforcement interrupted by the 
TO had relatively long mean intervals be- 
tween reinforcements, such as 7 and 9 min. 
Why he failed to find the conditioned ac- 
celeration when the VI schedules had shorter 
mean intervals is still unknown. More work 
is needed to clarify the relationship of fre- 
quency of reward, nature of schedule, Te- 
sponse rate, and the effects of the pre-T 
CS. Herrnstein also found that as TO ure 
tion was increased, conditioned acceleration 
in the presence of the pre-TO stimulus was 
increased. The range of TO durations investi- 
gated was from 30 sec. to 50 min., and 
schedule of reinforcement remained the 4° 
VI 7 min. for each duration. 

An unpublished thesis by Leitenbers 
(1965) employed a multiple-schedule design 
to compare in the same pigeon the effects 
on ongoing behavior of preshock stimuli ae 
pre-TO stimuli. Instead of the usual black°V’ 
only the key light was turned off during k 
Different colored key lights signalled sho 

ye D np foul 
and TO. The comparison was made with 2 
birds, using both the usual CER fep 
independent procedure and the response : 
tingent procedure which both Herrnstein E 
Ferster used in their TO studies. It made te 
difference which procedure was in effect: nt 
of pecking at a key on a VI 2.25-min. $7 
ule accelerated in the presence of the pe the 
stimulus and declined in the presence hock 
AE stimulus. With very weak sightly 
. sities, response rate accelerate? > .” ac- 
in the preshock stimulus; however’, ae a of 
celeration was apparently only a FUNCH os 
generalization because it was stil p red 


ea! 
when there was no shock, and it disapP 


in 
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when the TOs were removed. Clearly the 
effects On ongoing behavior of stimuli which 
signal TO are different from the effects of 
Stimuli which signal shock—even when ex- 
tremely low shock intensities are employed. 
The difference is not simply in degree of 
effect but in the direction of effect; pre-TO 
Stimuli accelerate ongoing behavior while 
Preshock stimuli suppress ongoing behavior. 

What is the basis for this difference? Both 
the CS that precedes shock and the CS that 
Precedes TO acquire negative reinforcing 
Properties. Perhaps the preshock CS acquires 
Steater aversive properties than the pre-TO 
CS. With relatively intense shock, when the 
key light changed to the color signaling shock, 
Pigeons were observed to turn their backs 
to the key. This kind of escape response 
Could readily account for the suppression of 
key-pecking behavior in the presence of the 
Preshock CS, (Unlike rats, pigeons were not 
Observed to freeze or defecate upon onset of 
the preshock CS, so these emotional responses 
could hardly account for the disruption in 
Ongoing behavior.) The pigeons did not turn 
from the key in the presence of the pre-TO 
CS because it was not aversive enough to 
Counteract the positively reinforced key-peck- 
ing behavior. A more formal test of this 
hypothesis is desirable, Ss will make a 
response to terminate a CS that precedes 
shock even if the shock is unavoidable 
(Kamin, 1956). Will they do the same with 
TO? 

But this analysis does not explain why key- 
es frequency accelerated in the presence 
of the pre-TO CS; it just explains why it 
did not decelerate. Th iti 
sponse to omission or e unconditioned re- 
antes we Att ed reward is heightened 
god eeno a emotionality, Amsel 

; ave shown in a host of 
studies (Amsel, 1958, 1962) that i 
speed following nonreinforcement E eee 
than running speed following reiiiorcem ane 
and faster than running speed following de- 
tention in a box where reinforcement was 
never before received. In the early Periods 
of extinction of a positively reinforced re- 
Sponse, bursts of rapid responding have been 
noted (Thompson, 1961, 1962). It has also 
been found that rats press the bar with 
Breater force during extinction than during 
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acquisition (Notterman, 1959). Wagner 
(1963) found that a startle response to a 
loud 1,000-cps tone was more vigorous when 
the tone was immediately preceded by a 
CS previously associated with nonreinforced 
trials in a partial reinforcement experiment. 

Although this is pure speculation, it is 
possible that the energizing effects produced 
by TO were conditioned to the pre-TO CS. 
This conditioned “excitement” acts upon the 
most strongly established habit, key-pecking 
behavior, to cause an increase in rate of 
response. 

The fact that a preshock CS suppresses 
ongoing behavior while a pre-TO CS acceler- 
ates ongoing behavior does not mean that 
TO is not an aversive stimulus, All it might 
signify is that TO does not act like shock 
in this particular paradigm—a paradigm 
which perhaps should not be used to define 
an aversive stimulus. 

Interestingly enough, when loud noise was 
used, neither conditioned suppression as 
with shock nor conditioned acceleration as 
with TO was observed (Leitenberg, 1965). 
Whether or not other aversive stimuli, such 
as bright light, extreme temperatures, painful 
blows, or pinches will lead to conditioned 
suppression, like shock; or conditioned accel- 
eration, like TO; or have very little effect, 
like loud noise; is still unknown. 


Escape into TO from Positive Reinforce- 
ment 


Recently, a series of studies demonstrated 
that rats and pigeons will impose upon them- 
selves interruptions from fixed-ratio sched- 
ules of positive reinforcement (Appel, 1963; 
Azrin, 1961; Thompson, 1964; Zimmerman 
& Ferster, 1964). 

Essentially the same procedure has been 
used in all these studies. Two response al- 
ternatives are available to S. Responses 
on one key are reinforced on a fixed-ratio 
schedule of reinforcement. The first response 
on the other key produces a TO. The second 
response on this switching key reinstates the 
fixed-ratio schedule on the food key. In this 
way the duration as well as the initiation of 
the TO is under the organism’s own control. 
Zimmerman and Ferster (1964) presented 
data which indicates that if the TO duration 
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is fixed at 10 sec. rather than being under 
the S’s own control, fewer TO-producing 
responses will be made. 

Azrin’s (1961) and Appel’s (1963) studies 
with pigeons show that as the number of 
responses required for reinforcement is raised, 
more TOs are produced, and they are kept 
in force for a longer time. The range of 
values in Azrin’s study was from FR 60 to 
FR 200, and the range in Appel’s study was 
FR 80 to FR 240. At FR 200 Azrin noted 
that almost 50% of the session was spent 
in TO conditions, and at FR 240 Appel found 
that 30% of the session was spent in TO 
conditions. 

Thompson (1964) showed that rats will 
press a bar three times in succession in order 
to produce 30-sec. TOs from fixed-ratio 
schedules of reinforcement. More TOs were 
produced as the fixed-ratio requirement for 
water reinforcement was raised, and fewer 
TOs were produced when the ratio require- 
ment was systematically lowered. 

Appel (1963) has also shown that when 
responses on the switching key are no longer 
effective in producing TOs, responding on 
this key undergoes the typical decline of 
extinction. 

A fixed-ratio schedule of reinforcement 
produces a characteristic pause in responding 
just after each food delivery. Following this 
postreinforcement pause, Ss usually rapidly 
run off the required number of responses 
without any systematic pause occurring 
again until after the next reinforcement 
is received. Appel (1963), Azrin (1961), and 
Thompson (1964) observed that tiöst TO 
producing responses occurred during these 
postreinforcement pauses between ratio runs 

Zimmerman and Ferster (1964) found that 
the more different the stimulus conditions are 
during the TO from what they are during the 
time the schedule is in effect, the more TOs 
the pigeon will impose upon himself. The 
direction of the stimulus change associated 
with the onset of the TO does not, however 
seem important. Thompson (1964) with rats 
and Azrin (1961) with pigeons found no 
difference between TOs introducing a de- 
crease in illumination in the box and TOs 
introducing an increase in illumination 

These studies do not preclude the possibil- 
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ity that the change in stimulus conditions 
per se is the major reinforcement for TO 
producing responses (cf. Appel, 1963). The 
prevalent view, however, is that the response 
leading to TO is an escape response; it termi- 
nates stimulus conditions associated with a 
schedule which is supposed to have aversive 
properties—either because of the amount of 
work required for positive reinforcement Or 
because of the long time intervals which 
elapse between successive reinforcements. 
What leads one to believe that escape is in- 
volved here is the increasing tendency tO 
produce a change in stimulus conditions 45 
the fixed-ratio requirement is raised. 

It would be interesting to know if 55 
would escape from other difficult schedules 
of reinforcement such as a DRL schedule 
which requires Ss to pause for certain 
lengths of time between successive responses, 
and a fixed-interval schedule with very long 
intervals between successive reinforcements- 
A yoked comparison between a variable- 
interval schedule and a high fixed-rate 
schedule would be especially useful. A yoke 
procedure could equate the number 
reinforcements and the intervals betwee” 
reinforcements, but a large number of 1 


jor 
sponses would not be required or made oe 
to each reinfor = 
cement by Ss run 
y actors 


the interval schedule. In this way the f 
responsible for the escape behavior—nu™ à 
of responses, or interval between reinforce 
ment, or both—can be determined. 
Allowing Ss to interrupt the SC?° js 
and impose rest periods upon themselver p- 
an excellent way to study the aversive 
erties of fatigue in animals, The FR rea pe 
ment could be kept constant, WPIC par 
amount of force necessary to press the ted 
could be varied systematically. It is expe as 
that more escape responses would be mato 
the force requirement was raised on th? 5 
contingent bar, ; T 
Experiments demonstrating escaP® ines 
from a schedule of positive rein or? eit 
indicate that: (a) some schedules oF aod 
forcement can contain both positiv de" 
negative reinforcement properties; ‘ing 1” 
pending on the nature of the unde” 
inforcement schedule, removal of the gow i 
tunity to acquire reinforcement must 
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considered potentially either a positive or 
negative reinforcer for nonsatiated organisms. 


Concluding Remarks and Summary 


The title of this paper asks what appears 
to be a simple question: is TO from positive 
reinforcement an aversive event? Upon ex- 
amination, however, the question becomes 
Somewhat complex. There is no single TO 
Procedure, and there is no single experimental 
Procedure for defining an aversive stimulus. 
Thus, there can hardly be a simple yes or 
no answer. 

The use of TO in five different paradigms 
Was analyzed: avoidance, escape, punish- 
ment, conditioned or secondary negative re- 
inforcement, and CER. Expectations about 
What effects TOs would have in these 
Paradigms (if indeed, TO were aversive) 
Were mainly determined by the effects that 
electric shocks have had in previous studies. 
Although there have been some experiments 
employing other supposed aversive stimuli, 
Such as loud noise, bright light, extreme 
temperatures, physical blows and pinches, 
and strange objects, they are greatly out- 
numbered by the literature on electric shock, 
Therefore, the first general prescription for 
future work in this area is: compare TO with 
a variety of other aversive stimuli in as 
many different experimental paradigms as 
possible. Shock may not be as representative 
of the class of aversive stimuli as is generally 
assumed ; and some paradigms may yield 
similarities between stimuli, while others 
yield differences, 

Although it can be said that in general TO 
tenn the major criteria of an aversive 
one S, the results of the reviewed experi- 

nts are by no means definitive. The ex- 
perimental evidence presents a complex pic- 
ture of the aversive properties of TO from 
positive reinforcement. However, complexity 
ls not automatically cause for despair. It is 
reliability and validity that matter, not 
Simplicity, The major findings of this review 
Were: (a) The design of studies demon- 
Strating avoidance and escape from TO allow 
another interpretation than that TO is aver- 
Šive, Avoidance and escape behavior could 

ave been maintained because they led to a 

| 8teater positive reinforcement frequency over 


a 


a given time period than did the absence of 
such behavior. (6) A similar problem of 
interpretation was inherent in some of the 
punishment studies. The other punishment 
studies, however, provide reasonably ade- 
quate evidence that TO is an effective pun- 
ishing stimulus, that is, it suppresses behavior 
which produces it; and that experimental 
extinction of a formerly reinforced response 
could be considered a special case of punish- 
ment. Although the procedures and effects of 
punishment and extinction were shown to 
have much in common, much more experi- 
mental work and theoretical analysis needs 
to be done before a final conclusion can be 
reached. (c) The most convincing evidence 
that TO is aversive comes from those studies 
demonstrating escape from stimuli which 
previously set the occasion for nonreinforce- 
ment. (d) Evidence implying differences be- 
tween TO and other aversive stimuli include 
those studies which show that ongoing posi- 
tively reinforced behavior is suppressed in the 
presence of stimuli preceding shock, but is 
accelerated in the presence of stimuli preced- 
ing TO from positive reinforcement. This 
difference was assumed to be a function of 
two factors: since pre-TO visual stimuli ac- 
quire less aversive properties than pre- 
shock stimuli, Ss are less likely to turn away 
from them; and stimuli preceding TO elicited 
“conditioned excitement” which acted upon 
the prevailing key-pecking habit to cause 
accelerated response rates. (e) Studies illus- 
trating self-imposed interruptions from high 
fixed-ratio schedules of positive reinforcement 
indicate that, depending on the nature of the 
schedule, TO could be either a positive or 
negative reinforcer. 

In addition to analyzing implications of 
existing results, this paper also advanced 
specific proposals for new research. In gen- 
eral, future research should be directed 
mainly towards finding independent variables 
having a functional relationship with the 
aversive properties of TO. It should not be 
directed solely towards giving yes or no 
answers to the question: does TO have 
aversive properties? 

Continuing research in this area should 
have great influence on theories of extinction, 


discrimination learning, and delay of rein- 
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forcement. In addition, increased experi- 
mental evidence relating to the punishing and 
negative reinforcing properties of removing 
reward, delaying reward, preventing work for 
reward, blocking consummatory behavior, 
and even reducing magnitude of reward 
should be of value to all those interested in 
the aversive control and modification of be- 
havior. If Solomon (1964) is correct in say- 
ing that the area of aversive control has 
been relatively neglected, it is also correct to 
say that these TO variables have been 
severely neglected in our research activity— 
a neglect which makes no sense in terms of 
theoretical relevance or in terms of practical 
use in our society. 
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THREE-MODE FACTOR ANALYSIS* 
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2 studies employing Tucker’s 3-mode fact 


or analysis are reported. The Ist is an 


analysis of semantic differential data. 4 scale factors and 4 concept factors were 


obtained. 1 subject-type was obtained. It 
scale factors to the concept factors. The 2nd 
of Anxiousness. 3 situation factors and 3 
types were obtained, each consisting of a 3 
to the situation factors. 


Classical factor analytic procedures are ap- 
plicable to two-way classification data; for 
example, to matrices of subjects by tests. 
Assuming that the data have already been 
standardized properly, the data matrix can 
be factored into a product of two matrices: a 
factor score matrix and a factor loading 
matrix. However, psychological data are often 
three-way (or multi-way) classification data; 
for example, multi-trait multi-method matrices 
(Campbell & Fiske, 1959), the semantic differ- 
ential (Osgood, Suci, & Tannenbaum, 1957), 
and the S-R Inventory of Anxiousness (Endler, 
Hunt, & Rosenstein, 1962). 

The problem of analyzing multi-mode data 
(i.e., multi-way classification tables) has been 
discussed in the literature frequently (Abel- 
son, 1960; Burt, 1955; Cattel, 1952; Guttman, 
1958; Mahmoud, 1955). Yet all the Proposed 
solutions are based on some sort of reduction 
of the data to a two-way table. The semantic 
differential is frequently reduced to a concept 
by scale table, averaging over subjects. When- 
ever individual differences are considered im- 
portant, and the mode of “sub jects” 
eliminated, the items are summed 
aged) over the “unimportant” 
example, d 


cannot be 
(or aver- 
mode; for 
ata given in a table of subjects by 
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consisted of a core matrix linking the 
study is an analysis of an S-R Inventory 
response factors were obtained. Also, 3 
X 3 matrix linking the response factors 


tests by occasions can be reduced by averaging 
over occasions. The S-R Inventory of od 
ness (Endler, Hunt, & Rosenstein, 1962) 
which consists of responses to situations by 
subjects, can be reduced to a response K 
subject table, averaging each response oe 
all situations. The latter is an example net 
the asymmetrical treatment, involved in moe 
reduction procedures, is not tenable. poen 
in their analysis of the S-R Inventory, Bp a 
et al. have perceived this point and n 
treated the data symmetrically. Summing si a 
ations, they obtained a subject by respa a 
matrix. Summing responses, they obtaine te 
subject by situation matrix. They also pore ie 
out the problem of the relation among, es 
factors obtained from each of these mater 
respectively, is of 

The problem of a simultaneous analy jon 
all the modes of the data and of the o in 
among the corresponding factors is the 7 
issue of three-mode factor analysis. 


Tue Loic or THreE-MoDE 


FACTOR ANALYSIS de 


-m0' 

The mathematical problem of thee? A R 
factor analysis has been solved by ection 
Tucker? (Tucker, 1964). The following ogie”! 
is an exposition of the structure and ‘°° 
meaning of this method. 

Three-mode factor analysis is not 
forward generalization of classical 
factor analysis. Two-mode factor a03? p be 
quires some modification before 1 frst t 
generalized. It is, therefore, simple" patti 

vay OP th 
resented a" 
1 si y 


ý 


* Implication of factor analysis of thr 
for measurement of change. Paper Pt 3 
Conference in Measuring Change, UD 
consin, May 1962. 


at is 
of 


á> 


hi 


THREE-MODE FACTOR ANALYSIS 


introduce a modified version of two-mode 
factor analysis and explain three-mode factor 
analysis as a straightforward generalization. 
Besides, this modfication also has intrinsic 
interest in the two-mode case. 


-! Modification of Two-Mode Factor Analysis 


Let us reduce the semantic differential data 
(subjects by scales by concepts) to a two-way 
classification averaging over subjects. We ob- 
tain a scales by concepts matrix, X, of order 
iX k and of rank P(p Si, p< k). Let us 
assume that the scores have already been 
Standardized with regard to origin and unit 
of measurement, so that sum of products can 
be used instead of correlations. With these 
assumptions, the Eckart-Young procedure can 
be employed (Eckart & Young, 1936). Accord- 
ing to this procedure ;X¥} can be factored as 
a product of three matrices: 


Nk = UiAcV i, [1] 

where U = the orthonormal latent vector 
matrix of XXT, 

V = the orthonormal latent vector 
matrix of XTX, 

A =a rectangular matrix with 0 en- 


tries off diagonally, p nonzero 
entries (A, +--+ Ap) in the diagonal 
starting with the upper left corner, 
andA?(j=1--- $) are the latent 
roots of XXT (or of XTX). With- 
out loss of generality it may be 
assumed that the à; are arranged 
in decreasing order of magnitude. 


If the m largest values of A are chosen, then 


iX, = UmAmnV y [2] 


is a least Square approximation to X for m 
factors. This can be reformulated so that i 
and V become principal-axes factor loadi g 
matrices: arg 


X 


ll 


(UA)A“(AV) 


= Üa? 5 [3] 


U and V are the factor loading matrices for 
Scales and concepts, respectively. The next 
Step is to introduce a rotation to simple struc- 
ture. Due to the logical symmetry of concepts 
and scales we have to rotate both U and V. 
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Let Č be rotated by T, and F be rotated 
by P 


¥ = (C1)TA PA (PV). [4] 


Defining 
A=UT, B=PŪ, G 


rp, [5] 


we obtain 


[6] 


The factor analytic solution is now a product 
of three matrices instead of the usual solution, 
XV = AB. This seems to be a loss of parsimony, 
but this loss is compensated by a gain: both 
scales and concepts are treated symmetri- 
cally. We have a factor loading matrix for 
scales and a factor loading matrix for con- 
cepts. And both matrices have been trans- 
formed to some especially meaningful form 
such as simple structure. 

In the formula AGB the two matrices 1 
and B have a symmetrical role and both are 
analogous to factor loadings, that is, they can 
be explained as weights applied to factor 
scores. Therefore, G functions as a factor 
Score matrix. But, unlike the usual factor 
analytic formula for the factor scores, the 
matrix G is reduced with respect to both rows 
and columns. To return to the semantic differ- 
ential example: both scales and concepts are 
reduced to factors, and the internal core matrix 
G gives the factor scores of concept factors on 
scale factors. 

A test by subjects paradigm may clarify 
the issue in more familiar terms, though the 
requirement for symmetry is not as cogent 
here. It is also convenient to use in this con- 
text the terminology “idealized entities,” like 
“idealized subjects,” and “idealized traits,” 
(Helm & Tucker, 1962, p. 439; Tucker & 
Messick, 1960, p. 5). 

In the fundamental equation of classical 
factor analysis, 

[7] 


Xk = ilmBe, 

iXx is a matrix of subjects by tests, ¿dm is a 
matrix of subjects by factors, and mB, is a 
matrix of factors by tests. In the matrix Bz, 
each row can be interpreted as the entries of 
an “idealized subject.” Employing this termi- 
nology, each idealized subject is a k-tuple 
consisting of the loadings of the k tests on a 
factor. 


Ï = AGB. 


io 


i» 


Fig. 1 


Sectioning matrix B by rows we obtain: 
(Eji, Xj +++ Win) = Gji (bn, bie +++ 
+ aj2(bo1, bog +++ 


+ djm(Bm1, Ome + 


The row (x); is a vector of predicted scores for 
a given subject j on the k tests. This row- 
vector is a weighted sum of the rows of B. 
The rows of B are the scores of the set of 
idealized subjects, and the row vector (a); is 
a row of weights for subject j. 

Now we have to modify this interpretation 
in order to adapt it to two-mode factor an- 


alysis with an inner core matrix. In the 
formula 


Xp = iAmGmBy [9] 


the roles of subjects and tests are symmetric, 
mGm is a matrix of scores of idealized subjects 
on idealized tests. The rows of G are for ideal- 
ized subjects and the columns are for idealized 
tests. The A and B matrices are weights. Each 
row of the matrix ;Am is a set of weights for 
a given subject. Applying these weights to 
the idealized subjects (i.e., to the rows of G) 
we obtain the scores of the subject expressed 


ee 
q ince, 
TE 


Fige 2 
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as a linear combination of the idealized ne 
jects. Similarly, each column of Bz is 4 = 
of weights for a given test. Applying thes 
weights to the idealized tests (i.e., the columns 
of G) we obtain the scores on the test expresse" 
as a linear combination of the idealized tests. 
Obviously, the pre- and postmultiplication ol 
G by A and B produces the scores (or P 
dicted scores if nonsignificant factors 7 g 
omitted) of all subjects on all tests ae 5 
Scores of the idealized subjects on the 160% 
ized tests, 


An Extension to Three-Mode Factor Analys's 


P rec- 
We can extend this formulation to “ 


mode factor analysis. The fundamenta 
mula of three-mode factor analysis is: 


p10] 
Rij = DLL dimbjpCeakmrr 
m p a 


f with 
The three-mode factor analysis starts, hese 


data given in a three-way classification- which 
data are reduced to an inner core Gmpa 


e i three 
1S again of three-way classification, 22" “rhe 
factor weight matrices, Am Bp 1C" re 


inner core, G, is analogous to the inner ¢ a 
the equation X = AGB. The A, B Joa 
matrices are analogous to the factor : 
matrices. These relations are on oe ngle 
grammatically in terms of boxes and re ed intg 
in Figure 1, The box G can be section’, 2 
q frontal matrices of order mX $ $ jed PY 
in Figure 2, Each of these is premul ing oe 
A and postmultiplied by B. The res“ yatti? 
can now be sectioned laterally into can P3, 
of order i X q each. Each of these rig? 
be postmultiplied by C as show? m 
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These operations reproduce the box Y of 
order i Xj X k. The multiplication of Gre 
by the three linear operators 1, B, and C, can 
be performed in any order; that is, we can 
start with multiplication of G by -1, then by 
B, and finally by C. We can also start with 
C, then multiply by B, and finally by A, and 
so on. This follows from the commutativity 
of multiplication and of summation in the 
fundamental formula, 10. 

Now the generalization of two-mode factor 
analysis becomes obvious. Let us return to 
the example of the semantic differential, The 
data are given as concepts by scales by sub- 
Jects (raters). We obtain three weight matrices 
(or factor loading matrices): one for concepts, 
one for scales, and one for subjects. This 
establishes three sets of factors, Then, we 
get an inner core “box.” In analogy to the 
two-mode case, this gives the interrelations 
that connect the three sets of factors, that is, 
the factor score of each concept factor on each 
scale factor for each subject factor. 

Employing the terminology of idealized 
entities, the inner core box gives the scores 
of each idealized concept on each idealized 
scale for each idealized subject; and the three 
Outer matrices are weights applied to these 
idealized entities needed to reproduce the 
original scores of subject per scale per concept. 


THE SEMANTIC DIFFERENTIAL 
Data 


This study is a reanalysis of semantic differ- 
ential data collected by Edward Ware (1958). 
There are 31 concepts rated on 20 scales by 60 
a The Subjects are high-school students 
aie ia according to the scheme indicated in 

The scales had seven categories (1-7) and 
were presented in graphical format. 


Fig. 3 
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TABLE 1 


DISTRIBUTION oF SUBJECTS IN THE SEMANTIC 
DIFFERENTIAL Stupy By SEX AND IQ 


Male Female Totals 
High IQ 15 15 30 
Low IQ 15 15 30 
Total 30 30 60 
Method 


Scoring. The data were rescored from a 1 to 7 scale 
to a +3 to —3 scale. Assuming that 0 (the scale 
midpoint) is a natural neutrality point, and assuming 
that differences in size of unit (or standard deviation) 
reflect individual differences that ought to be retained 
in the analysis, no additional standardization seems to 
be required. The scores were, therefore, treated as 
standard scores, and sums of products were employed 
throughout, instead of correlations. 

There are two ways to convert the data to standard 
scores that can be suggested as intuitively meaningful. 
One method is standardization of each scale as a vari- 
able, pooled over concepts and subjects. The other one 
is a separate standardization of each subject, which is 
essentially a conversion to ipsative scores. The effects 
of the mean and the unit of measurement have to be 
discussed separately. 

The mathematical formulas expressing the effects 
of elimination of the mean (or any additive constant) 
are quite cumbersome. Fortunately, the deviations of 
the mean of each scale from zero were negligible so that 
this issue did not have practical significance. 

On the other hand, the effect of the unit of measure- 
ment is quite simple. It follows from Formula 10 
that a multiplicative constant can be incorporated in 
any of the matrices 4, B, C depending on its subscripts 
(i.e., on the method of standardization). As shown in 
the next section, the score matrices of each subject 
are proportional; and by a conversion of each subject 
to a standard scale, we would lose some interesting 
information. 


Factor Loading Matrices. Let i be the subscript for 
subjects, 7 for concepts, k for scales; i’, j’, and &’ are 
used as alternative subscripts to i, j, and k. Three 
matrices of sums of products were computed: 


DZ Tijkie [11] 
ae 
D E xijerizre, [12] 
ó k 
DD Xijttijk C13] 
7% 


These matrices were analysed by the method of prin- 
cipal axes and rotated to simple structure. The result- 
ing factor loading matrices are the matrices 4 B 
and C of Equation 10. The inner core is ae 
least square solution4 (Levin, 1963), 


iL. R Tucker, Analysis procedures in three-mode 
factor analysis, in preparation. 


obtained as a 
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TABLE 2 
Four SCALE FAcToRS—VARIMAX ROTATION 


Factor Factor Factor Factor 


Scales z Il Ill IV 
1. Fast-Slow —05 585 10 15 
2. Even-Uneven 06 06 67 03 
3. Strong-Weak 22 66 00 —02 
4. Good-Bad óle —Ol 462 —08 
5. New-Old 06 —12 24 10 
6. Warm-Cool 05 02 04 54» 
7. Moral-Immoral 525 11 11 —01 
8. True-False 56°» 20 —03 —06 
9. Predictable-Unpre- 02 —05 61s —16 
dictable 
10. Sociable-Unsociable 408 12 438 09 
11. Hot-Cold —06 O+ 03 528 


12, Hard-Soft —4l* 46% —13 


13. Fair-Unfair 60 —09 —15 03 
14. Usual-Unusual 66! 05 —13 —04 
15. Kind-Cruel 565 —17 22 07 
16. Straight-Twisted 16 —05 28° 04 


17. Powerful-Powerless 24 55e —23 08 
18. Colorful-Colorless 33a 09 19 19 


19. Simple-Complex 16 —25 05 —23 
20. Emotional-Unemo- 14 28" —32s 388 
tional 


a Salient loadings. 


Results 


Number of Factors. The number of factors 
has been determined, primarily, on basis of 
the latent roots. If we plot the latent roots in 
decreasing order of magnitude we find that the 
roots drop abruptly at the beginning, then 
taper off slowly. The bending point is used as 
a cutting point to determine the number of 
factors. 

There is, apparently, only one major factor 
for subjects. The latent roots drop abruptly 
after the first root: 45,900; 5,500; 4,460; 
3,650; 3,470; 3,270; 3,110; 2,290; etc. Cor- 
respondingly, only one idealized subject is 
retained in the inner core. Consequently, for 
these data, the predicted scores of each subject 
are computed by multiplication of the idealized 
subject by a constant of proportionality. Em- 
ploying a somewhat different procedure, Ed- 
ward E. Ware arrived at the same conclusion.5 

Four factors have been retained for con- 
cepts. The latent roots are 26,890; 22,520; 
9,470; 7,440; 4,810; 4,340; 3,870; 3,800; etc. 
From the fourth root onward the roots taper 

off gradually and slowly. 


5 Personal communication, 


The results for scales are not so clear-cut. 
The latent roots are 31,090; 18,140; 10,910; 
8,920; 7,670; 6,510; 5,990; 5,760; etc. 

There are either four or five factors. Con- 
siderations of meaningful interpretation sup- 
port the decision to retain four factors for 
scales. The pattern obtained from a rota- 
tion of four scale factors provides a meaning- 
ful interpretation, and the factors can be 
identified in terms of factors commonly ob- 
tained in semantic differential studies. The 
pattern obtained by rotation of five scale 
factors was ambiguous. 

Corresponding to the number of factors 


TABLE 3 
Four Concerr FACTORS—VARIMAX ROTATION 


Factor Factor Factor Factor 


Concepts I 1I MIL IV 
1. Food 38" —07 12 13 
2. Baby 52" —08 15 —20 
3. Pig 07 a ä 4 œ 
4. Grief OL 438 o0 —02 
5. Silk 21-12 3 0 
6. Snow bi 02 43° ol 
7. Bottom =p 12 14 18 
8. Sickness —02 50» 04 -12 
9. Mother 64) -i4 o f 
10. Army 20 15 —05 45° 
11. Job —23 
08 22 21 
12. Jelly 15 —10 43" a 
13. Street 14 10 17 aan 
14. Fear 03 530 01 13 
15. Anger 04 54s —15 14 
16. Insane man —08 50" —06 ae 
17. Me 458 —05 00 A 
18. Cop 22 13 —O+ 36 
19. Sin ~10 55a —09 u 
20. Sleep 40a —07 25 1 
21. Pain 09 
Pai 04 49 —06 
22. Spider -07 33» 16 o 
23. Butter t = 12 44° nn 
24. Lamp 23 —06 15 i 
25. Dream 34" 15 12 3 
26. Chair 2 o% 
18 —13 
27. Snail 02 02 39° o% 
28. Sex 50 15 0l 40 
29. Dirt <02 14 36 49° 
30. Statue o —04 2 
18 o 
31. Beggar =0 o §& ae 
— al 


8 Salient loadings. 


| 
h 
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retained for subjects, concepts, and scales, 
the inner core is of order 1 X 4+ X 4. 


Factor Loading Matrices and Inter pretation 
of Factors. The factor matrices are given in 
Tables 2 and 3. Only the results of Varimax 
rotation (Harman, 1960, ch. 14) are reported 
here. The results of an oblique rotation were 
essentially the same. 

The factors can be identified on the basis of 
salient variables. The scale factors are essen- 
tially similar to the major factors regularly 
obtained in semantic differential studies. : 

The first factor has high loadings on the 
following scales: 


14. Usual-Unusual 66 
4. Good-Bad 61 
13. Fair-Unfair 60 
8. True-False 56 
15. Kind-Cruel 56 
7. Moral-Immoral 52 


This is the Evaluation factor. 


The second factor has high loadings on the 
following scales: 


3. Strong-Weak 66 
1. Fast-Slow 58 
17. Powerful-Powerless 55 


12. Hard-Soft 46 


This is a coalescence of the Potency and Ac- 
tivity factors. It was also obtained in a study 
by Triandis and Osgood (1958) and was called 
a Dynamism factor. 

The third factor has high loadings on these 
scales: 


2. Even-Uneven 67 


9. Predictable. Unpredi 
able-Unpredicta 
4. Good-Bad predictable 61 


. ) 46 
10. Sociable-Unsociable 43 
20. Emotional-Unemotional —32 
and possibly, 
16. Straight-Twisted 28 


This factor seems to come closest to a Sta- 
bility factor. 

The fourth factor has few high loadings, 
It is high on these scales: 


6. Warm-Cool 54 


11. Hot-Cold 52 
20. Emotional-Unemotional 38 
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Scales 6, 11, and 20 are virtually synonymous, 
especially when one considers their frequent 
metaphorical use. This factor may be called 
a Warmth factor. While the identification of 
this factor seems logically reasonable on basis 
of the data obtained here, it has not appeared 
as a separate factor in other studies of the 
semantic differential. 

Since factor analytic studies of the semantic 
differential report scale factors only, the iden- 
tification and interpretation of concept factors 
is based entirely on the results obtained in 
this study. 

The first factor has high loadings on the 
following concepts: 


9. Mother 6+ 
2. Baby 52 
28. Sex 50 
17. Me 45 
and possibly, 
20. Sleep 40 
1. Food 38 
25. Dream 34 


These concepts are apparently related to 
family and home surroundings and may be 
called a Human factor or perhaps an Intimacy 
factor. 

The second factor is high on these concepts: 


19. Sin 55 
15. Anger 54 
14. Fear 53 
8. Sickness 50 
16. Insane man 50 
21. Pain 49 
4. Grief 43 


These concepts signify intense emotional states 
related to agitated behavior. This may be 
called an Agitation factor. 

The third factor has high loadings on the 
following concepts: 


23. Butter 44 
12. Jelly 43 
6. Snow 43 
27. Snail 39 
29. Dirt 36 


and possibly, 


5. Silk 30 
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TABLE 4 
Core MATRIX CORRESPONDING TO THE 
VARDIAX ROTATION 
Scale Factors 

Concept Evaluation Dynamism Stability Warmth 

factors I II IL IV 
Intimacy I 18 03 —01 18 
Agitation II —03 10 —19 03 
Slickness III 12 —13 —06 =A5 
Toughness IV 03 21 09 —14 


These concepts signify soft, compressible 
and slick matter of shapeless and malleable 
material. This factor may be called a Slickness 
factor. 

The fourth factor has high loadings on these 
concepts: 


10. Army 45 

30. Statue 39 

18. Cop 36 

13. Street 35 

24. Lamp 33 
and possibly, 

2. Baby =20 


The highest loadings on the fourth factor are 
somewhat lower than those of the three pre- 
ceding factors. It seems that they have in 
common a property of Toughness, whether 
tactile, such as in Statue, or metaphorical, 


such as in Army. Note also the i 
loading of Baby. iad 


The Core Matrix. The core matrix is gi 
eee matrix is given 


In order to facilitate interpretation, a 
categorization based on rank order is adopted 


TABLE 5 


Core MATRIX CORRESPONDING TO VARIMAX Roration 
(SCHEMATIC REPRESENTATION) 


Scale Factors 


Concept Evaluation Dynamism Stabilit: 

Factors I IL Tie y u 
Intimacy I ++ 0 0 res 
Agitation II 0 F ie o 


Slickness II + = 
Toughness IV 0 TE 3: __ 
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here, namely, the lowest five entries, the high- 
est five entries, and the middle six entries. 
In Table 5 this corresponds to the following 
cutting points: 


High: |15| and above 
Medium: between|03| and |15| 
Low: [03| and below 


The core matrices can be represented sche- 
matically employing the following notation: 


++ for high positive 
+ for medium positive | 
0 for low or neutral 
= for medium negative 
for high negative 


The schematic representation is given in 
Table 5. 

The scores of the Agitation factor require 
a few comments. A similar relation between 
the concept “Mental disorder” and stability 
scales is reported by Nunally (1961). On the 
other hand, the neutral (or rather low negative) 
score of Agitation on the Evaluation factor 
defies explanation. An attempt to employ 
Oblimax rotation produced a shift downwards 
but not large enough to pull it out of the — 
neutrality range. \ 

The other entries of the table need no 
further comment. 


Tue S-R INVENTORY OF ANXIOUSNESS 
Data 


This study is a reanalysis of the data em- 
ployed by Endler, Hunt, and Rosenstein 
(1962). This inventory consists of 14 responses 
to 11 situations, Each response is matched with 


each situation, The total number of items 
is 154, 


—— 


an 
is based om 


The choice of the situations . - - s that 


intuitive attempt to select a variety of situation 
would be familiar through either direct OF vic ss 
experience to most college freshmen OF sophomeily 
(the Ss), a variety which would vary from the tyP 
innocuous to the quite threatening. ait 
The S-R inventory employs a five-step scale, vege 
from none to very much, with which S is asked p rh 
the felt intensity of his own response to the $ 
in question [Endler et al., 1962, p- 5]. 
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Here is an example of an item: 


You are about to go on a roller coaster. 
Heart beats faster 12345 


Not at all Much faster 


The subjects were 169 male and female 
students in elementary courses in psychology 
at Pennsylvania State University. 


Standardization of Scores. Unlike the se- 
mantic differential, it would be untenable to 
assume here that the midpoint of the scale of 
each item corresponds to the origin, neither 
can we assume that the units of measurement 
are equal. 

The scores were converted to standard 
scores by the usual procedure of subtraction 
of the mean and division by the standard 
deviation, Note that there are 11 X 14 = 154 
variables and there are 169 subjects for each 
variable. 

Computation of Faclor Loadings and Core. 
Though based on the same mathematical 
principles, a different method was employed 
here than with the semantic differential. The 
procedure provides another aspect of com- 
parison between three-mode factor analysis 
and some classical procedures (Mahmoud 
1955). 

Three correlation matrices were computed: 
(a) The intercorrelation of all items. This is 
a matrix of order 154. (b) The average inter- 
correlation of the situations. This is a matrix 
of order 11. (c) The average intercorrelation 
of the responses. This is a matrix of order 14 
Each of these matrices was factor analyzed by 
by a method of principal axes. 

e princi xi ' i 
(134 x ppb | as i pi e large matrix 
omen ployed to compute the 
5 be ie of the 154 items a situations 

y ponses) yielded three significant fac- 
tors. At this point, it seems that these facto: 
generate a confounding of two logically dis. 
tinct modes; for the loadings of each item 
involve both a situation and a response. How- 
ever, rearranging the loadings of each factor 
in the format of a matrix of 11 situations by 
14 responses, we obtain again a three-way 
Classification table consisting of factors by 
Situations by responses, thus keeping the dis- 
tinct modes neatly separated. This is the first 


s A ; 
< Step to obtain the inner core. The factors corre- 
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spond to the idealized subjects of our previous 
terminology. We still have two additional 
matrices: the principal axes of the situations 
and the principal axes of the responses. They 
can serve as linear operators to complete the 
computation of the inner core and to reduce 
the situations to idealized situations and the 
responses to idealized responses. The de- 
tailed procedure to obtain the inner core is 
discussed elsewhere (Levin, 1963; Tucker‘). 
The principal axes matrices of the situations 
and of the responses were rotated obliquely 
(Tucker, 1944; Tucker, 1955). The results 
obtained correspond to the matrices B and C 
of Equation 10. Due to methodological re- 
strictions, the matrix A corresponding to 
subject factors could not be computed. 


Results 


Number of Factors. Three factors were re- 
tained for situations and three factors for 
responses. 

The latent roots of the situation matrix are: 
3.54, .66, .47, .32, .28, .24, .13, .10, .08, .08, .04. 
The roots taper off slowly after the third root. 
The sum of the first three roots is 4.67. The 
first three roots account for 78% of the 
variance. 

The latent roots of the response matrix are: 
3.86, 1.55, .69, 38, .34, .28, .12, .10, .05, .03, 
00, —.03, —.05. The abrupt drop between 
the third and fourth root, and the slow taper- 
ing off from the fourth root onward is here 
more prominent. The sum of the first three 
roots is 6.10. The total sum of the roots is 
7.56. The first three roots account for 81% 
of the variance. 

The latent roots of 1s:Rıs4 are: 23.89, 7.39, 
6.37, 4.76, 4.28, 4.03, 3.50, 3.24, 2.95, 2.83, 
etc. There is a sharp drop between the first 
and second root, but there is also a relatively 
large decrease from the third to the fourth 
root. From the fourth root onward there is a 
slow gradual decline. This can be seen clearly 
from the differences between each pair of 
successive roots: 16.50, 1.02, 1.61, .48, 25; 53 
26, .29, 12, The order of magnitude of the 
first three differences is larger than that of the 
See ecole t maton 
ingly, the inner i i Laree tactors, Correspond- 

sY, er core is, thus, a 3 X 3 X 3 box. 
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TABLE 6 
Factor LOADINGS OF SITUATIONS— 
OBLIQUE ROTATION 
I Il ul 
1, Auto trip .01 OF 388 
2. New date 31» —.12 308 
3. Psychological experiment 00 .03 4s 
4, Ledge high on mountain — .06 568 OL 
side 
5. Speech before large group 353 —.10 .03 
6. Counseling Bureau for 20 O04 36" 
personal problems 
7. Sail boat on rough sea —.01 428 08 
8. Competitive contest 46% 09 —.14 
9. Alone in woods at night 07 AT —.06 
10. Interview for important 528 —.03 o4 
job 
11. Final exam in important ae .09 05 
course 


a Salient loadings. 


The Factor Loading Matrices. The rotated 
factor loadings are given in Tables 6 and 7. 
The salient variables of the first factor for 
situations are: 
5. Speech before large group 55 
10. Interview for important job 2 
8. Competitive contest 46 
11. Final exam in important course 33 
2. New date 31 


These are Situations where interpersonal stress 
relations are involved. This factor was called 


Interpersonal. 
TABLE 7 
Factor Loapincs OF RESPONSES—OBLIQUE ROTATION 


I IL Ill 
1. Heart beats faster 
2. Get “uneasy feeling” a a =e 
3. Emotions disrupt actions 468 —04 =i 
4, Feel exhilarated and AL “608 pe 
thrilled ee ae 
5. Want to avoid situation a 
6. Perspire Ek = = 
7. Need to urinate frequently .07 ‘09 oe 
8. Enjoy the challenge —.04 ‘Ti a 
9. Mouth gets dry al —03 "310 
10. Become immobilized -23 —.05 25 
11. Get fullfeelinginstomach 05 105 42 
12. Seek experiences like this —.05 13a ‘05 
13. Have loose bowels — 15 OL "558 
14, Experience nausea 02 ; 


a Salient loadings. 
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The salient variables of the second factor 
for situations are: 


4. Ledge high on mountain side .56 
9. Alone in woods at night AT 
7. Sail boat on rough sea 42 


These are situations of concern with inanimate 
personal danger. This factor was called 
Inanimate. 

. The salient variables for the third factor for 
situations are: 


3. Psychological experiment Jt 

1. Auto trip 38 

6. Counseling Bureau for personal 36 
problems 

2. New date 30 


The similarity of “Psychological experiment” 
and “Counseling Bureau” is obvious. A “New 
date” has a component of similarity to tinter- 
personal stress.” Yet its affinity to the antici- 
pation of a psychological experiment seems 
cogent. These items may, perhaps, be charac- 
terized as comprising situations of “intras 
personal stress.” The meaning of “Auto trip 
in this context is not clear, For brevity this 
factor may be labeled Psychological. 

The salient variables of the first factor for 
responses are: 


2. Get “uneasy feeling” 68 
1. Heart beats faster 63 
3. Emotions disrupt actions 46 
6. Perspire ae 
5. Want to avoid situation Jt 


This factor was called a Distress factor. = 
The salient variables of the second factor lor 
responses are: 


8. Enjoy the challenge a7 
12. Seek experiences like this AY 
4. Feel exhilarated and thrilled F 


5. Want to avoid situation 


This factor was called an Exhilaration meen 
The salient variables of the third factor for 
responses are: 


13. Have loose bowels Po 
7. Need to urinate frequently re 


11. Get full feeling in stomach rt 
14. Experience nausea A 
9. Mouth gets dry . 


This factor was called Autonomic. 


= — 
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The Core Matrix. The core is a three-mode 
matrix of order 3 X 3 X 3, that is, it consists 
of three idealized subjects or types. Each type 
is an interaction matrix of Situation Factors 
X Response Factors, that is, for each type we 
obtain the scores of response factors to the 
Situation factors. The core matrix is given in 
Table 8. It is arranged in Sections correspond- 
ing to types. Each section is a 3X 3 matrix 
with rows corresponding to response factors 
and columns to situation factors. Since the 
items were scored on a scale from “not at all” 
to “very much”; a low negative entry corre- 
sponds to “not at all,” a high positive entry 
Corresponds to “very much,” and a zero (or 
near zero) entry corresponds to the average 
(the scores were converted to standard scores 
at the first step of the computation). 

The first type is high in Distress response 
(40, 39, .22) and Autonomic response (41, 
37, -23), and low to low-average in Exhilara- 
tion (=05; Ll, .08). Comparing situations, 
we find a stronger reaction to the Inter- 
personal and Inanimate situations than to the 
Psychological situation. 

_ The second type is notably high in Exhilara- 

tion (.20, 44, 32), which is especially con- 
Spicuous in the Inanimate situation; but the 
Distress and Autonomic responses depend on 
the situation. The anxiousness responses to 
the Interpersonal factor are somewhere be- 
tween the Inanimate factor and the Psycho- 
logical factor: average Distress (—.07) and 
somewhat high Autonomic (.19). The Exhilara- 
tion response (.29) is almost the same as to 
the Psychological situation (.32). 

Comparing si ati re fi a very hi 
Exhilaration respo Cs) j on i ba high 
response (—.29) to the T era by 

oe nanimate situation. 
The Autonomic response is average (.05) but 
it is not necessarily an anxiousness response— 
it might be a seasickness type of reaction 
(note Situations 4 and 7). The response to the 
© Psychological situation is high in all factor 
~ Exhilaration (.32) and the anxiousness factors, 

at is, Distress and Autonomic (.20, 33). 

his conforms to the hypothesis in the mono- 
Staph by Endler et al. (p. 5) that excitement 

&Y come both in positive and negative forms. 

The third type has high Distress and Auto- 
tig e reactions (.30, .37) and low Eshilara- 
| N reaction (—.19) to the Inanimate situa- 


N 
i 
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TABLE 8 


Core MATRIX—OBLIQTE ROTATION 


Interpersonal Inanimate Psy chological 

Type I 

Distress 40 39 22 

Exhilaration —.05 =.11 03 

Autonomic Al 37 23 
Type IL 

Distress —.07 —.29 .20 

Exhilaration .29 oH 82 

Autonomic 19 05 33 
Type III 

Distress —.22 30 —.16 

Exhilaration 37 —.19 19 

Autonomic — 01 37 —wid 


tions, and conversely high Exhilaration G37; 
19) and low to low-average Distress and 
Autonomic reactions to the Interpersonal and 
Psychological situations (—.22, —.01, —.16, 
=.15),; 

In a schematic and somewhat simplified 
summary the three types can be described as 
follows: The first type is noted for general 
anxiousness reactions: Distress and Autonomic 
reactions, though he is slightly less sensitive 
to the Psychological situation. The second 
type shows a thrilling reaction, especially to 
physical adventure, and a tendency to get 
excited to the other situations, that is, to 
react both with some exhilaration and anxious- 
ness. The third type shows an anxiousness 
reaction to physical adventure and a thrilling 
reaction to the other situations, 


CONCLUSION 


The main contribution of three-mode factor 
analysis to classical factor analysis consists of 
the idea of an inner core and of the method to 
calculate its numerical values. The emperical 
studies reported here demonstrate that these 
values conform to a meaningful pattern. The 
inner core can be interpreted in terms of 
psychological types. Each type consists of a 
matrix which can be explained in terms of 
oe among the factors pertaining to 

a ar tw w. The amera Sa 
the other two modes. The Semantic differen- 
tial study demonstrates a Special case consist- 
Ing ol one type only 
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